AI context windows

Every model’s context window, in one place

The largest current production context window is 10,000,000 tokens — Meta Llama 4 Scout, as of June 23, 2026. Across 121 models from 8 providers; the historical chart below traces the running maximum from 2018 onward.

As of June 23, 2026.

Largest current
10,000,000
Llama 4 Scout
Median current
400,000
tokens, current models
2020 baseline
2,048
tokens (GPT-3)
Growth since 2020
~4,883×
on the maximum

Maximum context window over time

One marker per public model release, color-coded by provider. The dashed line traces the running maximum across all providers — the “largest context window in production” at each point in time. Y-axis is log-scale so a 5,000× range still reads.

1K8K32K128K1M10M201820192020202120222023202420252026OpenAI · GPT-1 · 512 tokens · Jun 11, 2018OpenAI · GPT-2 · 1,024 tokens · Feb 14, 2019OpenAI · GPT-3 · 2,048 tokens · May 28, 2020OpenAI · InstructGPT (text-davinci-002) · 4,097 tokens · Jan 27, 2022OpenAI · GPT-3.5 / ChatGPT launch · 4,096 tokens · Nov 30, 2022Meta · LLaMA 1 · 2,048 tokens · Mar 3, 2023OpenAI · GPT-4 · 8,192 tokens · Mar 14, 2023Anthropic · Claude 1 · 9,000 tokens · Mar 14, 2023Google · Bard (LaMDA-based) · 8,192 tokens · Mar 21, 2023Google · PaLM 2 · 8,192 tokens · May 10, 2023OpenAI · GPT-3.5 Turbo 16K · 16,385 tokens · Jun 13, 2023OpenAI · GPT-4-32K · 32,768 tokens · Jun 13, 2023Anthropic · Claude 2 · 100,000 tokens · Jul 11, 2023Meta · Llama 2 · 4,096 tokens · Jul 18, 2023Alibaba · Qwen-7B · 8,192 tokens · Aug 3, 2023Anthropic · Claude Instant 1.2 · 100,000 tokens · Aug 9, 2023Meta · Code Llama · 16,384 tokens · Aug 24, 2023Alibaba · Qwen-14B / Qwen 1.5 · 8,192 tokens · Sep 25, 2023Mistral · Mistral 7B · 8,192 tokens · Sep 27, 2023DeepSeek · DeepSeek-LLM 7B / 67B · 4,096 tokens · Nov 2, 2023DeepSeek · DeepSeek-Coder · 16,384 tokens · Nov 2, 2023xAI · Grok 1 · 8,192 tokens · Nov 4, 2023OpenAI · GPT-4 Turbo · 128,000 tokens · Nov 6, 2023Anthropic · Claude 2.1 · 200,000 tokens · Nov 21, 2023Google · Gemini 1.0 Pro · 32,768 tokens · Dec 6, 2023Mistral · Mixtral 8x7B · 32,768 tokens · Dec 11, 2023Google · Gemini 1.0 Ultra · 32,768 tokens · Feb 8, 2024Google · Gemini 1.5 Pro · 1,000,000 tokens · Feb 15, 2024Mistral · Mistral Large · 32,768 tokens · Feb 26, 2024Anthropic · Claude 3 Opus · 200,000 tokens · Mar 4, 2024Anthropic · Claude 3 Sonnet · 200,000 tokens · Mar 4, 2024Anthropic · Claude 3 Haiku · 200,000 tokens · Mar 13, 2024xAI · Grok 1.5 · 131,072 tokens · Mar 28, 2024Meta · Llama 3 (8B / 70B) · 8,192 tokens · Apr 18, 2024DeepSeek · DeepSeek-V2 · 128,000 tokens · May 6, 2024OpenAI · GPT-4o · 128,000 tokens · May 13, 2024Google · Gemini 1.5 Flash · 1,000,000 tokens · May 14, 2024Alibaba · Qwen2 family · 131,072 tokens · Jun 7, 2024Anthropic · Claude 3.5 Sonnet · 200,000 tokens · Jun 20, 2024OpenAI · GPT-4o mini · 128,000 tokens · Jul 18, 2024Meta · Llama 3.1 (8B / 70B / 405B) · 131,072 tokens · Jul 23, 2024Mistral · Mistral Large 2 · 128,000 tokens · Jul 24, 2024xAI · Grok 2 · 131,072 tokens · Aug 13, 2024OpenAI · o1-preview · 128,000 tokens · Sep 12, 2024OpenAI · o1-mini · 128,000 tokens · Sep 12, 2024Alibaba · Qwen2.5 family · 131,072 tokens · Sep 19, 2024Google · Gemini 1.5 (002 refresh, 2M) · 2,000,000 tokens · Sep 24, 2024Meta · Llama 3.2 (Vision + Edge) · 131,072 tokens · Sep 25, 2024Google · Gemini 1.5 Flash-8B · 1,000,000 tokens · Oct 3, 2024Anthropic · Claude 3.5 Sonnet (new) · 200,000 tokens · Oct 22, 2024Anthropic · Claude 3.5 Haiku · 200,000 tokens · Nov 4, 2024OpenAI · o1 · 200,000 tokens · Dec 5, 2024Meta · Llama 3.3 70B Instruct · 131,072 tokens · Dec 6, 2024Google · Gemini 2.0 Flash · 1,000,000 tokens · Dec 11, 2024DeepSeek · DeepSeek-V3 · 128,000 tokens · Dec 26, 2024DeepSeek · DeepSeek-R1 · 128,000 tokens · Jan 20, 2025Mistral · Mistral Small 3 · 32,768 tokens · Jan 30, 2025OpenAI · o3-mini · 200,000 tokens · Jan 31, 2025Google · Gemini 2.0 Pro Experimental · 2,000,000 tokens · Feb 5, 2025xAI · Grok 3 · 131,072 tokens · Feb 17, 2025Anthropic · Claude 3.7 Sonnet · 200,000 tokens · Feb 24, 2025OpenAI · GPT-4.5 (Orion) · 128,000 tokens · Feb 27, 2025Google · Gemini 2.5 Pro · 1,048,576 tokens · Mar 25, 2025Meta · Llama 4 Scout · 10,000,000 tokens · Apr 5, 2025Meta · Llama 4 Maverick · 1,000,000 tokens · Apr 5, 2025OpenAI · GPT-4.1 · 1,000,000 tokens · Apr 14, 2025OpenAI · o3 · 200,000 tokens · Apr 16, 2025OpenAI · o4-mini · 200,000 tokens · Apr 16, 2025Alibaba · Qwen3 family · 131,072 tokens · Apr 28, 2025Anthropic · Claude Opus 4 · 200,000 tokens · May 22, 2025Anthropic · Claude Sonnet 4 · 200,000 tokens · May 22, 2025OpenAI · o3-pro · 200,000 tokens · Jun 10, 2025Google · Gemini 2.5 Flash · 1,048,576 tokens · Jun 17, 2025xAI · Grok 4 · 256,000 tokens · Jul 9, 2025Google · Gemini 2.5 Flash-Lite · 1,048,576 tokens · Jul 22, 2025Alibaba · Qwen3-Coder · 262,144 tokens · Jul 22, 2025Anthropic · Claude Opus 4.1 · 200,000 tokens · Aug 5, 2025OpenAI · GPT-5 · 400,000 tokens · Aug 7, 2025DeepSeek · DeepSeek-V3.1 · 128,000 tokens · Aug 21, 2025Alibaba · Qwen3-Next-80B-A3B · 262,144 tokens · Sep 11, 2025Alibaba · Qwen3-Max · 1,000,000 tokens · Sep 15, 2025xAI · Grok 4 Fast · 2,000,000 tokens · Sep 19, 2025Alibaba · Qwen3-VL family · 262,144 tokens · Sep 23, 2025Anthropic · Claude Sonnet 4.5 · 200,000 tokens · Sep 29, 2025Anthropic · Claude Haiku 4.5 · 200,000 tokens · Oct 15, 2025OpenAI · GPT-5.1 · 400,000 tokens · Nov 12, 2025xAI · Grok 4.1 · 256,000 tokens · Nov 17, 2025Google · Gemini 3 Pro · 1,048,576 tokens · Nov 18, 2025xAI · Grok 4.1 Fast · 2,000,000 tokens · Nov 19, 2025Anthropic · Claude Opus 4.5 · 200,000 tokens · Nov 24, 2025DeepSeek · DeepSeek-V3.2 (+ Speciale) · 128,000 tokens · Dec 1, 2025Mistral · Mistral Large 3 · 256,000 tokens · Dec 2, 2025OpenAI · GPT-5.2 · 400,000 tokens · Dec 11, 2025Google · Gemini 3 Flash · 1,048,576 tokens · Dec 17, 2025Alibaba · Qwen3-Coder-Next · 262,144 tokens · Feb 4, 2026OpenAI · GPT-5.3-Codex · 400,000 tokens · Feb 5, 2026Anthropic · Claude Opus 4.6 · 1,000,000 tokens · Feb 5, 2026Alibaba · Qwen3.5 family + Plus · 262,144 tokens · Feb 16, 2026Anthropic · Claude Sonnet 4.6 · 1,000,000 tokens · Feb 17, 2026Google · Gemini 3.1 Pro · 1,048,576 tokens · Feb 19, 2026OpenAI · GPT-5.3 Instant · 400,000 tokens · Mar 3, 2026Google · Gemini 3.1 Flash-Lite · 1,048,576 tokens · Mar 3, 2026OpenAI · GPT-5.4 · 1,050,000 tokens · Mar 5, 2026xAI · Grok 4.20 · 2,000,000 tokens · Mar 10, 2026Mistral · Mistral Small 4 · 256,000 tokens · Mar 16, 2026Alibaba · Qwen 3.6-Max-Preview · 1,000,000 tokens · Apr 2, 2026Meta · Muse Spark · 262,144 tokens · Apr 8, 2026Anthropic · Claude Opus 4.7 · 1,000,000 tokens · Apr 16, 2026Alibaba · Qwen3.6-35B-A3B · 262,144 tokens · Apr 16, 2026xAI · Grok 4.3 · 1,000,000 tokens · Apr 17, 2026Alibaba · Qwen3.6-27B · 262,144 tokens · Apr 22, 2026OpenAI · GPT-5.5 · 1,050,000 tokens · Apr 23, 2026DeepSeek · DeepSeek-V4-Pro · 1,000,000 tokens · Apr 24, 2026DeepSeek · DeepSeek-V4-Flash · 1,000,000 tokens · Apr 24, 2026Mistral · Mistral Medium 3.5 · 256,000 tokens · Apr 28, 2026xAI · Grok Build 0.1 · 256,000 tokens · May 14, 2026Google · Gemini 3.5 Flash · 1,048,576 tokens · May 19, 2026Alibaba · Qwen3.7-Max · 1,000,000 tokens · May 20, 2026Anthropic · Claude Opus 4.8 · 1,000,000 tokens · May 28, 2026Alibaba · Qwen3.7-Plus · 1,000,000 tokens · May 31, 2026Anthropic · Claude Fable 5 · 1,000,000 tokens · Jun 9, 2026tokens (log)
AnthropicOpenAIGooglexAIMetaDeepSeekMistralAlibaba

Every model

Sort by any column. Filter by provider or by minimum context window.

Min context:
Llama 4 Scout
10M context, the published max as of 2026-06-23.
Meta(Llama)
10M
Apr 5, 2025
Available
Gemini 1.5 (002 refresh, 2M)
First mainstream 2M context.
Google(Gemini)
2M
8.2K
Sep 24, 2024
Legacy
Gemini 2.0 Pro Experimental
Experimental SKU only; never reached a stable gemini-2.0-pro id. Superseded by Gemini 2.5 Pro seven weeks later.
Google(Gemini)
2M
8.2K
Feb 5, 2025
Legacy
Grok 4 Fast
First Grok 2M context. Retired May 15, 2026.
xAI(Grok)
2M
Sep 19, 2025
Deprecated
Grok 4.1 Fast
Retired May 15, 2026.
xAI(Grok)
2M
Nov 19, 2025
Deprecated
Grok 4.20
Three SKUs (multi-agent / reasoning / non-reasoning); multi-agent SKU documented at 2M context on docs.x.ai.
xAI(Grok)
2M
Mar 10, 2026
Available
GPT-5.4
1M-token context across the line.
OpenAI(ChatGPT)
1.1M
128K
Mar 5, 2026
Available
GPT-5.5
1M-token context; current flagship.
OpenAI(ChatGPT)
1.1M
128K
Apr 23, 2026
Current
Gemini 2.5 Pro
Stable 1M, 2M roadmap.
Google(Gemini)
1M
65.5K
Mar 25, 2025
Available
Gemini 2.5 Flash
Google(Gemini)
1M
65.5K
Jun 17, 2025
Available
Gemini 2.5 Flash-Lite
Google(Gemini)
1M
65.5K
Jul 22, 2025
Available
Gemini 3 Pro
Shut down Mar 9, 2026; migrate to Gemini 3.1 Pro.
Google(Gemini)
1M
65.5K
Nov 18, 2025
Deprecated
Gemini 3 Flash
Google(Gemini)
1M
65.5K
Dec 17, 2025
Available
Gemini 3.1 Pro
Replaced Gemini 3 Pro after Mar 9, 2026 shutdown.
Google(Gemini)
1M
65.5K
Feb 19, 2026
Current
Gemini 3.1 Flash-Lite
Google(Gemini)
1M
65.5K
Mar 3, 2026
Available
Gemini 3.5 Flash
Most intelligent model for sustained frontier performance.
Google(Gemini)
1M
65.5K
May 19, 2026
Current
Gemini 1.5 Pro
First mainstream 1M context.
Google(Gemini)
1M
8.2K
Feb 15, 2024
Legacy
Gemini 1.5 Flash
Google(Gemini)
1M
8.2K
May 14, 2024
Legacy
Gemini 1.5 Flash-8B
Google(Gemini)
1M
8.2K
Oct 3, 2024
Legacy
Gemini 2.0 Flash
Shut down Jun 1, 2026.
Google(Gemini)
1M
8.2K
Dec 11, 2024
Legacy
Llama 4 Maverick
Meta(Llama)
1M
Apr 5, 2025
Current
GPT-4.1
First OpenAI 1M-context production model.
OpenAI(ChatGPT)
1M
32.8K
Apr 14, 2025
Legacy
Qwen3-Max
Trillion-parameter MoE; superseded by Qwen 3.6-Max-Preview.
Alibaba(Qwen)
1M
Sep 15, 2025
Legacy
Claude Opus 4.6
1M GA Mar 13, 2026 (no long-context premium); 128K max output.
Anthropic(Claude)
1M
128K
Feb 5, 2026
Available
Claude Sonnet 4.6
1M context now default.
Anthropic(Claude)
1M
64K
Feb 17, 2026
Available
Qwen 3.6-Max-Preview
Proprietary preview, DashScope only; superseded by Qwen3.7-Max.
Alibaba(Qwen)
1M
Apr 2, 2026
Legacy
Claude Opus 4.7
1M default on Claude API / Bedrock / Vertex; 128K max output.
Anthropic(Claude)
1M
128K
Apr 16, 2026
Available
Grok 4.3
Native video input; in-chat doc generation.
xAI(Grok)
1M
Apr 17, 2026
Current
DeepSeek-V4-Pro
First DeepSeek 1M; Thinking + Non-Thinking modes; 384K max output.
DeepSeek(DeepSeek)
1M
384K
Apr 24, 2026
Current
DeepSeek-V4-Flash
384K max output.
DeepSeek(DeepSeek)
1M
384K
Apr 24, 2026
Current
Qwen3.7-Max
Closed-weights; native extended-thinking.
Alibaba(Qwen)
1M
May 20, 2026
Current
Claude Opus 4.8
1M default (200K on Microsoft Foundry); 128K output.
Anthropic(Claude)
1M
128K
May 28, 2026
Current
Qwen3.7-Plus
Multimodal vision+language sibling to Qwen3.7-Max; 1M context, 65K max output; DashScope only.
Alibaba(Qwen)
1M
65.5K
May 31, 2026
Current
Claude Fable 5
Mythos-class flagship; 1M context, 128K output. Access suspended (removed from the API) Jun 12, 2026 by US export-control directive; queries route to Opus 4.8.
Anthropic(Claude)
1M
128K
Jun 9, 2026
Deprecated
GPT-5
Unified router across reasoning + chat. API shutdown scheduled Jul 23, 2026.
OpenAI(ChatGPT)
400K
128K
Aug 7, 2025
Legacy
GPT-5.1
OpenAI(ChatGPT)
400K
128K
Nov 12, 2025
Legacy
GPT-5.2
OpenAI(ChatGPT)
400K
128K
Dec 11, 2025
Legacy
GPT-5.3-Codex
Code-tuned variant.
OpenAI(ChatGPT)
400K
128K
Feb 5, 2026
Available
GPT-5.3 Instant
ChatGPT default update; superseded by GPT-5.5 Instant May 5, 2026; remains accessible to paid users.
OpenAI(ChatGPT)
400K
128K
Mar 3, 2026
Available
Qwen3-Coder
256K native, extrapolated to 1M; Apache 2.0.
Alibaba(Qwen)
262.1K
Jul 22, 2025
Available
Qwen3-Next-80B-A3B
Ultra-sparse MoE; 256K native, extensible to 1M.
Alibaba(Qwen)
262.1K
Sep 11, 2025
Available
Qwen3-VL family
Vision-language; 256K native interleaved context.
Alibaba(Qwen)
262.1K
Sep 23, 2025
Available
Qwen3-Coder-Next
256K native, extensible to 1M; coding-agent MoE.
Alibaba(Qwen)
262.1K
Feb 4, 2026
Available
Qwen3.5 family + Plus
1M extended via YaRN.
Alibaba(Qwen)
262.1K
Feb 16, 2026
Available
Muse Spark
262K context; closed weights, API in private preview.
Meta(Llama)
262.1K
Apr 8, 2026
Current
Qwen3.6-35B-A3B
First Qwen3.6 open-weights release; 35B-total / 3B-active MoE.
Alibaba(Qwen)
262.1K
Apr 16, 2026
Current
Qwen3.6-27B
Hybrid Gated DeltaNet + self-attention; 1M extensible.
Alibaba(Qwen)
262.1K
Apr 22, 2026
Current
Grok 4
Retired May 15, 2026.
xAI(Grok)
256K
Jul 9, 2025
Deprecated
Grok 4.1
xAI(Grok)
256K
Nov 17, 2025
Legacy
Mistral Large 3
Granular MoE flagship, Apache 2.0.
Mistral(Mistral)
256K
Dec 2, 2025
Current
Mistral Small 4
Mistral(Mistral)
256K
Mar 16, 2026
Current
Mistral Medium 3.5
Largest dense open-weights; merged chat/reasoning/code.
Mistral(Mistral)
256K
Apr 28, 2026
Current
Grok Build 0.1
Purpose-built agentic coding model.
xAI(Grok)
256K
May 14, 2026
Current
Claude 2.1
First 200K window. Retired Jul 21, 2025.
Anthropic(Claude)
200K
Nov 21, 2023
Deprecated
Claude 3 Opus
Retired Jan 5, 2026.
Anthropic(Claude)
200K
4.1K
Mar 4, 2024
Deprecated
Claude 3 Sonnet
Retired Jul 21, 2025.
Anthropic(Claude)
200K
4.1K
Mar 4, 2024
Deprecated
Claude 3 Haiku
Retired Apr 20, 2026.
Anthropic(Claude)
200K
4.1K
Mar 13, 2024
Deprecated
Claude 3.5 Sonnet
Retired Oct 28, 2025.
Anthropic(Claude)
200K
8.2K
Jun 20, 2024
Deprecated
Claude 3.5 Sonnet (new)
Computer-use beta. Retired Oct 28, 2025.
Anthropic(Claude)
200K
8.2K
Oct 22, 2024
Deprecated
Claude 3.5 Haiku
Retired Feb 19, 2026.
Anthropic(Claude)
200K
8.2K
Nov 4, 2024
Deprecated
o1
First production o-series. API shutdown Oct 23, 2026.
OpenAI(ChatGPT)
200K
100K
Dec 5, 2024
Deprecated
o3-mini
OpenAI(ChatGPT)
200K
100K
Jan 31, 2025
Deprecated
Claude 3.7 Sonnet
Extended thinking. Retired Feb 19, 2026.
Anthropic(Claude)
200K
64K
Feb 24, 2025
Deprecated
o3
Demoted to legacy following o3-deep-research deprecation Apr 22, 2026.
OpenAI(ChatGPT)
200K
100K
Apr 16, 2025
Legacy
o4-mini
API snapshot shutdown scheduled Oct 23, 2026.
OpenAI(ChatGPT)
200K
100K
Apr 16, 2025
Legacy
Claude Opus 4
Retired Jun 15, 2026.
Anthropic(Claude)
200K
32K
May 22, 2025
Deprecated
Claude Sonnet 4
Retired Jun 15, 2026.
Anthropic(Claude)
200K
64K
May 22, 2025
Deprecated
o3-pro
Deprecated Jun 11, 2026; API shutdown scheduled Dec 11, 2026.
OpenAI(ChatGPT)
200K
100K
Jun 10, 2025
Legacy
Claude Opus 4.1
Deprecated Jun 5, 2026; retires Aug 5, 2026.
Anthropic(Claude)
200K
32K
Aug 5, 2025
Legacy
Claude Sonnet 4.5
Superseded by Sonnet 4.6; still served via the API (retirement no sooner than Sep 29, 2026).
Anthropic(Claude)
200K
64K
Sep 29, 2025
Available
Claude Haiku 4.5
Anthropic(Claude)
200K
64K
Oct 15, 2025
Available
Claude Opus 4.5
Anthropic(Claude)
200K
64K
Nov 24, 2025
Available
Grok 1.5
xAI(Grok)
131.1K
Mar 28, 2024
Legacy
Qwen2 family
Apache 2.0 turn.
Alibaba(Qwen)
131.1K
Jun 7, 2024
Legacy
Llama 3.1 (8B / 70B / 405B)
First Llama 128K.
Meta(Llama)
131.1K
Jul 23, 2024
Available
Grok 2
xAI(Grok)
131.1K
Aug 13, 2024
Legacy
Qwen2.5 family
Alibaba(Qwen)
131.1K
Sep 19, 2024
Legacy
Llama 3.2 (Vision + Edge)
Meta(Llama)
131.1K
Sep 25, 2024
Available
Llama 3.3 70B Instruct
Meta(Llama)
131.1K
Dec 6, 2024
Available
Grok 3
131K context across grok-3 / grok-3-fast / grok-3-mini / grok-3-mini-fast; still served via the API.
xAI(Grok)
131.1K
Feb 17, 2025
Available
Qwen3 family
Hybrid Thinking / Non-Thinking; 256K extensible.
Alibaba(Qwen)
131.1K
Apr 28, 2025
Legacy
GPT-4 Turbo
DevDay 2023; first OpenAI 128K.
OpenAI(ChatGPT)
128K
4.1K
Nov 6, 2023
Deprecated
DeepSeek-V2
DeepSeek(DeepSeek)
128K
May 6, 2024
Legacy
GPT-4o
Multimodal flagship; replaced as ChatGPT default by GPT-5.
OpenAI(ChatGPT)
128K
16.4K
May 13, 2024
Legacy
GPT-4o mini
Replaced GPT-3.5 Turbo on the mini tier.
OpenAI(ChatGPT)
128K
16.4K
Jul 18, 2024
Legacy
Mistral Large 2
Mistral(Mistral)
128K
Jul 24, 2024
Legacy
o1-preview
Reasoning track preview.
OpenAI(ChatGPT)
128K
32.8K
Sep 12, 2024
Deprecated
o1-mini
Smaller, cheaper reasoning companion to o1-preview.
OpenAI(ChatGPT)
128K
65.5K
Sep 12, 2024
Deprecated
DeepSeek-V3
DeepSeek(DeepSeek)
128K
Dec 26, 2024
Legacy
DeepSeek-R1
Reasoning track flagship.
DeepSeek(DeepSeek)
128K
Jan 20, 2025
Legacy
GPT-4.5 (Orion)
Largest pretraining run; research preview. API access ended Jul 14, 2025; removed from ChatGPT Jun 27, 2026.
OpenAI(ChatGPT)
128K
16.4K
Feb 27, 2025
Legacy
DeepSeek-V3.1
DeepSeek(DeepSeek)
128K
Aug 21, 2025
Legacy
DeepSeek-V3.2 (+ Speciale)
DeepSeek(DeepSeek)
128K
Dec 1, 2025
Available
Claude 2
First mainstream 100K window. Retired Jul 21, 2025.
Anthropic(Claude)
100K
Jul 11, 2023
Deprecated
Claude Instant 1.2
Retired Nov 6, 2024.
Anthropic(Claude)
100K
Aug 9, 2023
Deprecated
GPT-4-32K
Extended GPT-4 variant.
OpenAI(ChatGPT)
32.8K
Jun 13, 2023
Legacy
Gemini 1.0 Pro
Google(Gemini)
32.8K
8.2K
Dec 6, 2023
Legacy
Mixtral 8x7B
Mistral(Mistral)
32.8K
Dec 11, 2023
Legacy
Gemini 1.0 Ultra
Gemini Advanced launch.
Google(Gemini)
32.8K
8.2K
Feb 8, 2024
Legacy
Mistral Large
Mistral(Mistral)
32.8K
Feb 26, 2024
Legacy
Mistral Small 3
Superseded by Mistral Small 4.
Mistral(Mistral)
32.8K
Jan 30, 2025
Legacy
GPT-3.5 Turbo 16K
First mainstream 16K model.
OpenAI(ChatGPT)
16.4K
Jun 13, 2023
Legacy
Code Llama
Code-specialized.
Meta(Llama)
16.4K
Aug 24, 2023
Legacy
DeepSeek-Coder
DeepSeek(DeepSeek)
16.4K
Nov 2, 2023
Legacy
Claude 1
Retired Nov 6, 2024.
Anthropic(Claude)
9K
Mar 14, 2023
Deprecated
GPT-4
32K variant later in 2023.
OpenAI(ChatGPT)
8.2K
Mar 14, 2023
Legacy
Bard (LaMDA-based)
Pre-Gemini chat product.
Google(Gemini)
8.2K
Mar 21, 2023
Legacy
PaLM 2
Google(Gemini)
8.2K
May 10, 2023
Legacy
Qwen-7B
Lab debut.
Alibaba(Qwen)
8.2K
Aug 3, 2023
Legacy
Qwen-14B / Qwen 1.5
Alibaba(Qwen)
8.2K
Sep 25, 2023
Legacy
Mistral 7B
Sliding-window attention.
Mistral(Mistral)
8.2K
Sep 27, 2023
Legacy
Grok 1
xAI(Grok)
8.2K
Nov 4, 2023
Legacy
Llama 3 (8B / 70B)
Meta(Llama)
8.2K
Apr 18, 2024
Legacy
InstructGPT (text-davinci-002)
RLHF-tuned.
OpenAI(ChatGPT)
4.1K
Jan 27, 2022
Legacy
GPT-3.5 / ChatGPT launch
ChatGPT launch; later 16K variant.
OpenAI(ChatGPT)
4.1K
Nov 30, 2022
Legacy
Llama 2
Meta(Llama)
4.1K
Jul 18, 2023
Legacy
DeepSeek-LLM 7B / 67B
DeepSeek(DeepSeek)
4.1K
Nov 2, 2023
Legacy
GPT-3
The 2020 baseline.
OpenAI(ChatGPT)
2K
May 28, 2020
Legacy
LLaMA 1
Meta(Llama)
2K
Mar 3, 2023
Legacy
GPT-2
Doubled GPT-1.
OpenAI(ChatGPT)
1K
Feb 14, 2019
Legacy
GPT-1
Original transformer paper baseline.
OpenAI(ChatGPT)
512
Jun 11, 2018
Legacy

Showing all 121 models.

By provider

Each provider’s context-window history at a glance. The current maximum and the lifetime maximum may differ when a provider has rolled an early extended-context experiment back into a smaller production window.

Anthropic
Claude family
Versions →
Current
1M
Claude Opus 4.8
Lifetime max
1M
First model
9K
Mar 14, 2023
Most recent
Jun 9, 2026Claude Fable 51M
May 28, 2026Claude Opus 4.81M
Apr 16, 2026Claude Opus 4.71M
Feb 17, 2026Claude Sonnet 4.61M
Feb 5, 2026Claude Opus 4.61M
Nov 24, 2025Claude Opus 4.5200K
OpenAI
ChatGPT family
Versions →
Current
1.1M
GPT-5.5
Lifetime max
1.1M
First model
512
Jun 11, 2018
Most recent
Apr 23, 2026GPT-5.51.1M
Mar 5, 2026GPT-5.41.1M
Mar 3, 2026GPT-5.3 Instant400K
Feb 5, 2026GPT-5.3-Codex400K
Dec 11, 2025GPT-5.2400K
Nov 12, 2025GPT-5.1400K
Google
Gemini family
Versions →
Current
1M
Gemini 3.5 Flash
Lifetime max
2M
First model
8.2K
Mar 21, 2023
Most recent
May 19, 2026Gemini 3.5 Flash1M
Feb 19, 2026Gemini 3.1 Pro1M
Dec 17, 2025Gemini 3 Flash1M
Nov 18, 2025Gemini 3 Pro1M
Jul 22, 2025Gemini 2.5 Flash-Lite1M
xAI
Grok family
Versions →
Current
256K
Grok Build 0.1
Lifetime max
2M
First model
8.2K
Nov 4, 2023
Most recent
May 14, 2026Grok Build 0.1256K
Apr 17, 2026Grok 4.31M
Mar 10, 2026Grok 4.202M
Nov 19, 2025Grok 4.1 Fast2M
Nov 17, 2025Grok 4.1256K
Sep 19, 2025Grok 4 Fast2M
Meta
Llama family
Versions →
Current
262.1K
Muse Spark
Lifetime max
10M
First model
2K
Mar 3, 2023
Most recent
Apr 8, 2026Muse Spark262.1K
Apr 5, 2025Llama 4 Scout10M
Apr 5, 2025Llama 4 Maverick1M
Dec 6, 2024Llama 3.3 70B Instruct131.1K
Sep 25, 2024Llama 3.2 (Vision + Edge)131.1K
Jul 23, 2024Llama 3.1 (8B / 70B / 405B)131.1K
DeepSeek
DeepSeek family
Versions →
Current
1M
DeepSeek-V4-Pro
Lifetime max
1M
First model
16.4K
Nov 2, 2023
Most recent
Apr 24, 2026DeepSeek-V4-Pro1M
Apr 24, 2026DeepSeek-V4-Flash1M
Aug 21, 2025DeepSeek-V3.1128K
Jan 20, 2025DeepSeek-R1128K
Dec 26, 2024DeepSeek-V3128K
Mistral
Mistral family
Versions →
Current
256K
Mistral Medium 3.5
Lifetime max
256K
First model
8.2K
Sep 27, 2023
Most recent
Apr 28, 2026Mistral Medium 3.5256K
Mar 16, 2026Mistral Small 4256K
Dec 2, 2025Mistral Large 3256K
Jan 30, 2025Mistral Small 332.8K
Jul 24, 2024Mistral Large 2128K
Feb 26, 2024Mistral Large32.8K
Alibaba
Qwen family
Versions →
Current
1M
Qwen3.7-Plus
Lifetime max
1M
First model
8.2K
Aug 3, 2023
Most recent
May 31, 2026Qwen3.7-Plus1M
May 20, 2026Qwen3.7-Max1M
Apr 22, 2026Qwen3.6-27B262.1K
Apr 16, 2026Qwen3.6-35B-A3B262.1K
Feb 16, 2026Qwen3.5 family + Plus262.1K

Notes and caveats

Effective vs. nominal context. Several long-context models advertise large windows but degrade past a certain length on real-world tasks — this page records the documented nominal capacity, not benchmark-measured effective length. The latter is benchmark-dependent and out of scope per the section’s no-benchmarks rule.

Input vs. output limits. Most providers document a single “context window” that includes both prompt and response tokens. A few (notably the OpenAI o-series and the Anthropic Claude 4 generation) document a separate output-token cap. Where separately documented, the Output column shows it; otherwise the row treats the input window as the total budget.

Beta and tier-gated context. Some providers ship a default context size for the standard API and a larger one behind a beta flag, batch endpoint, or paid tier. The headline number on this page is the standard-API value documented as generally available; the per-row notes call out when a beta or tier-gated extended window exists.

Open-weights inference. For open-weights models (Llama, DeepSeek, Mistral, Qwen) the “context window” is the value the model card claims; serving infrastructure (vLLM, Together, Fireworks, Hugging Face Inference) often caps the deployed window lower for memory reasons. Always check the specific endpoint’s docs before relying on the full nominal window.

Tokenizer differences. One token is not a fixed unit across providers. OpenAI’s o200k tokenizer, Anthropic’s tokenizer, Google’s SentencePiece, and Meta’s tiktoken-derived tokenizers all produce different token counts for identical text. Compare context windows in tokens, not in characters or pages, but treat them as a same-provider apples-to-apples comparison rather than a strict cross-provider one.

About this page

Cross-family comparison page in the /ai/ section. Each row’s context-window value is sourced from the provider’s own model documentation — OpenAI’s platform.openai.com/docs/models, Anthropic’s platform.claude.com, Google’s ai.google.dev, xAI’s docs.x.ai, Meta’s llama.com and huggingface.co/meta-llama, DeepSeek’s api-docs.deepseek.com, Mistral’s docs.mistral.ai, and Alibaba’s help.aliyun.com/zh/dashscope.

The model roster mirrors the per-family pages already on this site — Claude, ChatGPT, Gemini, Grok, Llama, DeepSeek, Mistral, Qwen — so each row links back to the matching version-page entry for the full per-release context.

Refreshed daily. Each refresh re-verifies every row against the provider’s current documentation; values that changed since the previous run are updated and the row’s “as of” date is bumped. See release cadence for the cross-family ship-cadence picture this page complements.

Last updated: June 23, 2026. 121 models · 8 providers.

Last refreshed 2026-06-23 by Callisto — corrected Grok 4.20 context to 2M.