copilot: Correct o3-mini context length #24152

chapel · 2025-02-03T22:28:26Z

It should be 200k (with 100k output). I can't find anything that puts it at 20k and the changeover in 2f82374 only changed the name from o1-mini to o3-mini

References:

Release Notes:

Corrected Github Copilot o3-mini context length

It should be 200k (with 100k output). References: https://docs.github.com/en/copilot/using-github-copilot/asking-github-copilot-questions-in-github#ai-models-for-copilot-chat https://github.com/marketplace/models/azure-openai/o3-mini https://platform.openai.com/docs/models#o3-mini

crates/copilot/src/copilot_chat.rs

maxdeviant · 2025-02-03T22:36:33Z

I can't find anything that puts it at 20k and the changeover in 2f82374 only changed the name from o1-mini to o3-mini

Here's the context for where the 20k limit came from: #20362

chapel · 2025-02-03T22:39:09Z

Here's the context for where the 20k limit came from: #20362

Ah, I didn't find that, but appreciate the context. I haven't tested the API, if they are limiting it to that then obviously close the PR. Wish they publicly posted what their limits were regardless.

maxdeviant · 2025-02-03T22:42:31Z

Here's the context for where the 20k limit came from: #20362

Ah, I didn't find that, but appreciate the context. I haven't tested the API, if they are limiting it to that then obviously close the PR. Wish they publicly posted what their limits were regardless.

That was for o1-mini, so the question is whether o3-mini has a higher token count and what that is.

It looks like there is an API endpoint retrieve the information: #20362 (comment)

It doesn't seem to work unauthenticated for me, so I'd need to figure out how to auth against it.

chapel · 2025-02-03T23:00:49Z

It doesn't seem to work unauthenticated for me, so I'd need to figure out how to auth against it.

Sadly my work account for copilot doesn't give us access to the new models right now, later tonight I can see if the free version of copilot lets you use o3-mini and if the context is higher.

itsaphel · 2025-02-04T13:25:47Z

Can confirm 200k input and 100k output tokens, according to the /models endpoint anyway:

    {
        "id": "azureml://registries/azure-openai/models/o3-mini/versions/2025-01-31",
        "registry": "azure-openai",
        "name": "o3-mini",
        "original_name": "o3-mini",
        "friendly_name": "OpenAI o3-mini",
        "task": "chat-completion",
        "publisher": "OpenAI",
        "license": "custom",
        "summary": "o3-mini includes the o1 features with significant cost-efficiencies for scenarios requiring high performance.",
        "model_family": "OpenAI",
        "model_version": "2025-01-31",
        "popularity": 55.01,
        "tags": [
            "reasoning",
            "multilingual",
            "coding"
        ],
        "rate_limit_tier": "custom",
        "supported_languages": [
            "en",
            "it",
            "af",
            "es",
            "de",
            "fr",
            "id",
            "ru",
            "pl",
            "uk",
            "el",
            "lv",
            "zh",
            "ar",
            "tr",
            "ja",
            "sw",
            "cy",
            "ko",
            "is",
            "bn",
            "ur",
            "ne",
            "th",
            "pa",
            "mr",
            "te"
        ],
        "max_output_tokens": 100000,
        "max_input_tokens": 200000,
        "training_data_date": null,
        "license_description": "Use of Azure OpenAI Service is subject to applicable Microsoft\nProduct Terms <https://www.microsoft.com/licensing/terms/welcome/welcomepage> including the Universal License Terms for Microsoft Generative AI Services and the service-specific terms for the Azure OpenAI product offering.",
        "static_model": false,
        "supported_input_modalities": [
            "text"
        ],
        "supported_output_modalities": [
            "text"
        ]
    },

I have not personally tried to input 200k tokens, however.

notpeter · 2025-02-04T14:02:13Z

Thanks!

SirSilver · 2025-02-05T08:11:57Z

Got this error with 0.173.1-pre using copilot o3-mini, so I believe the context length was correct before this MR or wrong model is used under o3-mini name

itsaphel · 2025-02-05T13:06:19Z

Hm. I've just sniffed the request made by Copilot in VSCode, and I get this:

{
	"data": [{
		"capabilities": {
			"family": "gpt-3.5-turbo",
			"limits": {
				"max_context_window_tokens": 16384,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 12288
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "cl100k_base",
			"type": "chat"
		},
		"id": "gpt-3.5-turbo",
		"model_picker_enabled": false,
		"name": "GPT 3.5 Turbo",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-3.5-turbo-0613"
	}, {
		"capabilities": {
			"family": "gpt-3.5-turbo",
			"limits": {
				"max_context_window_tokens": 16384,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 12288
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "cl100k_base",
			"type": "chat"
		},
		"id": "gpt-3.5-turbo-0613",
		"model_picker_enabled": false,
		"name": "GPT 3.5 Turbo",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-3.5-turbo-0613"
	}, {
		"capabilities": {
			"family": "gpt-4",
			"limits": {
				"max_context_window_tokens": 32768,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 32768
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "cl100k_base",
			"type": "chat"
		},
		"id": "gpt-4",
		"model_picker_enabled": false,
		"name": "GPT 4",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4-0613"
	}, {
		"capabilities": {
			"family": "gpt-4",
			"limits": {
				"max_context_window_tokens": 32768,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 32768
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "cl100k_base",
			"type": "chat"
		},
		"id": "gpt-4-0613",
		"model_picker_enabled": false,
		"name": "GPT 4",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4-0613"
	}, {
		"capabilities": {
			"family": "gpt-4o",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 64000,
				"vision": {
					"max_prompt_image_size": 3145728,
					"max_prompt_images": 1
				}
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4o",
		"model_picker_enabled": true,
		"name": "GPT 4o",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-2024-05-13"
	}, {
		"capabilities": {
			"family": "gpt-4o",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 64000,
				"vision": {
					"max_prompt_image_size": 3145728,
					"max_prompt_images": 1
				}
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4o-2024-05-13",
		"model_picker_enabled": false,
		"name": "GPT 4o",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-2024-05-13"
	}, {
		"capabilities": {
			"family": "gpt-4o",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 64000
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4-o-preview",
		"model_picker_enabled": false,
		"name": "GPT 4o",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-2024-05-13"
	}, {
		"capabilities": {
			"family": "gpt-4o",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 16384,
				"max_prompt_tokens": 64000
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4o-2024-08-06",
		"model_picker_enabled": false,
		"name": "GPT 4o",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-2024-08-06"
	}, {
		"capabilities": {
			"family": "text-embedding-ada-002",
			"limits": {
				"max_inputs": 256
			},
			"object": "model_capabilities",
			"supports": {},
			"tokenizer": "cl100k_base",
			"type": "embeddings"
		},
		"id": "text-embedding-ada-002",
		"model_picker_enabled": false,
		"name": "Embedding V2 Ada",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "text-embedding-ada-002"
	}, {
		"capabilities": {
			"family": "text-embedding-3-small",
			"limits": {
				"max_inputs": 512
			},
			"object": "model_capabilities",
			"supports": {
				"dimensions": true
			},
			"tokenizer": "cl100k_base",
			"type": "embeddings"
		},
		"id": "text-embedding-3-small",
		"model_picker_enabled": false,
		"name": "Embedding V3 small",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "text-embedding-3-small"
	}, {
		"capabilities": {
			"family": "text-embedding-3-small",
			"object": "model_capabilities",
			"supports": {
				"dimensions": true
			},
			"tokenizer": "cl100k_base",
			"type": "embeddings"
		},
		"id": "text-embedding-3-small-inference",
		"model_picker_enabled": false,
		"name": "Embedding V3 small (Inference)",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "text-embedding-3-small"
	}, {
		"capabilities": {
			"family": "gpt-4o-mini",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 12288
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4o-mini",
		"model_picker_enabled": false,
		"name": "GPT 4o Mini",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-mini-2024-07-18"
	}, {
		"capabilities": {
			"family": "gpt-4o-mini",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 12288
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "gpt-4o-mini-2024-07-18",
		"model_picker_enabled": false,
		"name": "GPT 4o Mini",
		"object": "model",
		"preview": false,
		"vendor": "Azure OpenAI",
		"version": "gpt-4o-mini-2024-07-18"
	}, {
		"capabilities": {
			"family": "o1-ga",
			"limits": {
				"max_context_window_tokens": 200000,
				"max_prompt_tokens": 20000
			},
			"object": "model_capabilities",
			"supports": {
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "o1",
		"model_picker_enabled": true,
		"name": "o1 (Preview)",
		"object": "model",
		"preview": true,
		"vendor": "Azure OpenAI",
		"version": "o1-2024-12-17"
	}, {
		"capabilities": {
			"family": "o1-ga",
			"limits": {
				"max_context_window_tokens": 200000,
				"max_prompt_tokens": 20000
			},
			"object": "model_capabilities",
			"supports": {
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "o1-2024-12-17",
		"model_picker_enabled": false,
		"name": "o1 (Preview)",
		"object": "model",
		"preview": true,
		"vendor": "Azure OpenAI",
		"version": "o1-2024-12-17"
	}, {
		"capabilities": {
			"family": "o3-mini",
			"limits": {
				"max_context_window_tokens": 200000,
				"max_output_tokens": 100000,
				"max_prompt_tokens": 20000
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "o3-mini",
		"model_picker_enabled": true,
		"name": "o3-mini (Preview)",
		"object": "model",
		"preview": true,
		"vendor": "Azure OpenAI",
		"version": "o3-mini-2025-01-31"
	}, {
		"capabilities": {
			"family": "o3-mini",
			"limits": {
				"max_context_window_tokens": 200000,
				"max_output_tokens": 100000,
				"max_prompt_tokens": 20000
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "o3-mini-2025-01-31",
		"model_picker_enabled": false,
		"name": "o3-mini (Preview)",
		"object": "model",
		"preview": true,
		"vendor": "Azure OpenAI",
		"version": "o3-mini-2025-01-31"
	}, {
		"capabilities": {
			"family": "o3-mini",
			"limits": {
				"max_context_window_tokens": 200000,
				"max_output_tokens": 100000,
				"max_prompt_tokens": 20000
			},
			"object": "model_capabilities",
			"supports": {
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "o3-mini-paygo",
		"model_picker_enabled": false,
		"name": "o3-mini (Preview)",
		"object": "model",
		"preview": true,
		"vendor": "Azure OpenAI",
		"version": "o3-mini-paygo"
	}, {
		"capabilities": {
			"family": "claude-3.5-sonnet",
			"limits": {
				"max_context_window_tokens": 128000,
				"max_output_tokens": 4096,
				"max_prompt_tokens": 128000,
				"vision": {
					"max_prompt_image_size": 3145728,
					"max_prompt_images": 1
				}
			},
			"object": "model_capabilities",
			"supports": {
				"parallel_tool_calls": true,
				"streaming": true,
				"tool_calls": true
			},
			"tokenizer": "o200k_base",
			"type": "chat"
		},
		"id": "claude-3.5-sonnet",
		"model_picker_enabled": true,
		"name": "Claude 3.5 Sonnet (Preview)",
		"object": "model",
		"policy": {
			"state": "enabled",
			"terms": "Enable access to the latest Claude 3.5 Sonnet model from Anthropic. [Learn more about how GitHub Copilot serves Claude 3.5 Sonnet](https://docs.github.com/copilot/using-github-copilot/using-claude-sonnet-in-github-copilot)."
		},
		"preview": true,
		"vendor": "Anthropic",
		"version": "claude-3.5-sonnet"
	}],
	"object": "list"
}

Namely, for o3-mini:

"limits": {
    "max_context_window_tokens": 200000,
    "max_output_tokens": 100000,
    "max_prompt_tokens": 20000
},

So indeed, it looks like only 20k tokens can be sent in the prompt. Assuming Copilot does even allow a full 200k to be sent (as in, assuming it's not restricting beyond the model's capabilities), I presume it's not directly via the chat context. I can see file contexts are being chunked and sent to a separate API endpoint, which I presume is the 'correct' way to use the entire available token window. I'll try look into it a bit more.

I presume this change should be reverted though. My apologies the first time -- it seems the different methods of calling Copilot give different results and limits.

This reverts commit 2853649.

Reverts #24152 See comment: #24152 (comment) Manually confirmed >20k generates error.

Reverts zed-industries#24152 See comment: zed-industries#24152 (comment) Manually confirmed >20k generates error.

chapel · 2025-02-06T06:38:40Z

Thanks @SirSilver and @itsaphel for getting to the bottom of this, it is unfortunate that copilot doesn't actually document this.

It is an interesting distinction though, where you can potentially go up to 200k tokens but only send up to 20k at a time. I wonder if it considers cached tokens (things it's seen before like openai).

This comment was marked as resolved.

Sign in to view

maxdeviant changed the title ~~Correcting o3-mini context length~~ Correct o3-mini context length Feb 3, 2025

maxdeviant changed the title ~~Correct o3-mini context length~~ copilot: Correct o3-mini context length Feb 3, 2025

This comment was marked as resolved.

Sign in to view

cla-bot bot added the cla-signed The user has signed the Contributor License Agreement label Feb 3, 2025

This comment was marked as resolved.

Sign in to view

Use numeric separator

0e00cb9

chapel commented Feb 3, 2025

View reviewed changes

crates/copilot/src/copilot_chat.rs Show resolved Hide resolved

notpeter merged commit 2853649 into zed-industries:main Feb 4, 2025
13 checks passed

chapel deleted the patch-1 branch February 4, 2025 18:04

notpeter added a commit that referenced this pull request Feb 5, 2025

Revert "copilot: Correct o3-mini context length (#24152)"

0de7568

This reverts commit 2853649.

notpeter mentioned this pull request Feb 5, 2025

Revert "copilot: Correct o3-mini context length" #24275

Merged

notpeter added a commit that referenced this pull request Feb 5, 2025

Revert "copilot: Correct o3-mini context length" (#24275)

992125b

Reverts #24152 See comment: #24152 (comment) Manually confirmed >20k generates error.

osiewicz pushed a commit to RemcoSmitsDev/zed that referenced this pull request Feb 5, 2025

Revert "copilot: Correct o3-mini context length" (zed-industries#24275)

da66f75

Reverts zed-industries#24152 See comment: zed-industries#24152 (comment) Manually confirmed >20k generates error.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

copilot: Correct o3-mini context length #24152

copilot: Correct o3-mini context length #24152

chapel commented Feb 3, 2025 •

edited

Loading

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

maxdeviant commented Feb 3, 2025

chapel commented Feb 3, 2025 •

edited

Loading

maxdeviant commented Feb 3, 2025

chapel commented Feb 3, 2025

itsaphel commented Feb 4, 2025

notpeter commented Feb 4, 2025

SirSilver commented Feb 5, 2025

itsaphel commented Feb 5, 2025 •

edited

Loading

chapel commented Feb 6, 2025

copilot: Correct o3-mini context length #24152

copilot: Correct o3-mini context length #24152

Conversation

chapel commented Feb 3, 2025 • edited Loading

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

maxdeviant commented Feb 3, 2025

chapel commented Feb 3, 2025 • edited Loading

maxdeviant commented Feb 3, 2025

chapel commented Feb 3, 2025

itsaphel commented Feb 4, 2025

notpeter commented Feb 4, 2025

SirSilver commented Feb 5, 2025

itsaphel commented Feb 5, 2025 • edited Loading

chapel commented Feb 6, 2025

chapel commented Feb 3, 2025 •

edited

Loading

chapel commented Feb 3, 2025 •

edited

Loading

itsaphel commented Feb 5, 2025 •

edited

Loading