AI model lineage

How the major AI models are actually related

The major AI model lineages span 130 models across 8 per-provider family trees, connected by 125 typed edges — successions, post-trainings, fine-tunes, distillations, retrainings, and the GPT line / o-series merge into GPT-5, as of June 23, 2026. Each per-provider tree below cites the provider documentation that establishes each edge.

As of June 23, 2026.

Current production flagships

One node per provider, color-coded. Click any flagship to jump into the per-provider family tree below.

How to read the trees

Each box is a model. Box fill indicates lifecycle status (Current emerald, Available sky, Legacy amber, Deprecated rose). Each box has a left-edge stripe in the provider color. Each arrow is a typed edge:

Succession
Next release in the family timeline. Provider has not formally documented base-model continuity, so no claim is made about whether the new release is a fine-tune, post-training, or full retraining of the previous one.
Post-training
Provider documents this release as a post-training update over the same base model — RLHF, instruction tuning, behavior changes — without a new base.
Fine-tune
Provider documents this release as a fine-tune of the source model (often for a domain — code, vision, instruction-following).
Distilled from
Provider documents this release as a student model distilled from a larger teacher model in the same family.
New base
Provider explicitly documents this release as a new base model, retrained from scratch or with substantial architecture / data changes that make it not a fine-tune of the previous flagship.
Lines merged
Provider unifies two or more previously-separate model tracks into one release (e.g. OpenAI merging the GPT line and the o-series reasoning track into GPT-5).

Hover any node or edge for its full label, ship date, and (for edges) the source the relationship was sourced to. The "Lineage as text" block under each tree contains the same information for screen readers and crawlers.

Anthropic · Claude

Versions page →
MythosOpusSonnetHaiku2023202420252026Succession — Claude 1 → Claude 2Distilled from — Claude 1 → Claude Instant 1.2 Anthropic positioned Claude Instant as a smaller, faster sibling of the Claude flagship; sourced to the Claude 2 announcement.DistillPost-training — Claude 2 → Claude 2.1 Anthropic documented Claude 2.1 as an iterative update on Claude 2 with the 200K context extension, not a new base.Post-trNew base — Claude 2.1 → Claude 3 Opus Anthropic introduced the Claude 3 family as a new generation of base models, not a Claude 2 fine-tune.New baseSuccession — Claude 3 Opus → Claude 3 Sonnet Anthropic shipped Opus / Sonnet / Haiku as the three tiers of the Claude 3 generation simultaneously.Succession — Claude 3 Opus → Claude 3 HaikuSuccession — Claude 3 Sonnet → Claude 3.5 SonnetPost-training — Claude 3.5 Sonnet → 3.5 Sonnet (new) Anthropic documented the October 2024 'new 3.5 Sonnet' as an upgraded version under the same model id.Post-trDistilled from — 3.5 Sonnet (new) → Claude 3.5 Haiku Anthropic documented Claude 3.5 Haiku as a smaller sibling sharing the 3.5 generation training.DistillSuccession — 3.5 Sonnet (new) → Claude 3.7 SonnetNew base — Claude 3.7 Sonnet → Claude Sonnet 4 Anthropic introduced Claude 4 as a new generation, with Opus 4 and Sonnet 4 as new bases rather than fine-tunes of 3.7.New baseNew base — Claude 3.7 Sonnet → Claude Opus 4New baseSuccession — Claude Opus 4 → Claude Opus 4.1Succession — Claude Opus 4.1 → Claude Opus 4.5Succession — Claude Opus 4.5 → Claude Opus 4.6Succession — Claude Opus 4.6 → Claude Opus 4.7Succession — Claude Sonnet 4 → Claude Sonnet 4.5Succession — Claude Sonnet 4.5 → Claude Sonnet 4.6Succession — Claude 3.5 Haiku → Claude Haiku 4.5Succession — Claude Opus 4.7 → Claude Opus 4.8Succession — Claude Opus 4.8 → Claude Fable 5 Anthropic introduced Claude Fable 5 (June 9, 2026) as the first generally-available model in the new Mythos class — a tier positioned above Opus in capability. Opus 4.8 is described as the 'next-most-capable model' that Fable 5 falls back to on safeguard-triggered queries; Anthropic does not disclose Fable 5 as a fine-tune, post-training, or distillation of Opus 4.8 — per the anthropic.com/news/claude-fable-5-mythos-5 announcement.Claude 1Claude 1 · Mar 14, 2023 · LegacyClaude 2Claude 2 · Jul 11, 2023 · LegacyClaude Instant 1.2Claude Instant 1.2 · Aug 9, 2023 · LegacyClaude 2.1Claude 2.1 · Nov 21, 2023 · LegacyClaude 3 OpusClaude 3 Opus · Mar 4, 2024 · LegacyClaude 3 SonnetClaude 3 Sonnet · Mar 4, 2024 · LegacyClaude 3 HaikuClaude 3 Haiku · Mar 13, 2024 · LegacyClaude 3.5 SonnetClaude 3.5 Sonnet · Jun 20, 2024 · Legacy3.5 Sonnet (new)3.5 Sonnet (new) · Oct 22, 2024 · LegacyClaude 3.5 HaikuClaude 3.5 Haiku · Nov 4, 2024 · LegacyClaude 3.7 SonnetClaude 3.7 Sonnet · Feb 24, 2025 · LegacyClaude Opus 4Claude Opus 4 · May 22, 2025 · LegacyClaude Sonnet 4Claude Sonnet 4 · May 22, 2025 · LegacyClaude Opus 4.1Claude Opus 4.1 · Aug 5, 2025 · LegacyClaude Sonnet 4.5Claude Sonnet 4.5 · Sep 29, 2025 · AvailableClaude Haiku 4.5Claude Haiku 4.5 · Oct 15, 2025 · CurrentClaude Opus 4.5Claude Opus 4.5 · Nov 24, 2025 · AvailableClaude Opus 4.6Claude Opus 4.6 · Feb 5, 2026 · AvailableClaude Sonnet 4.6Claude Sonnet 4.6 · Feb 17, 2026 · CurrentClaude Opus 4.7Claude Opus 4.7 · Apr 16, 2026 · AvailableClaude Opus 4.8Claude Opus 4.8 · May 28, 2026 · CurrentClaude Fable 5Claude Fable 5 · Jun 9, 2026 · Available
Lineage as text (21 edges) ↓

Every edge in the Claude tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

OpenAI · ChatGPT

Versions page →
GPT lineo-seriesGPT-5Chat sub2018–222023202420252026Post-training — GPT-3 → GPT-3.5 OpenAI's GPT-3.5 (text-davinci series and ChatGPT launch) was an RLHF-fine-tuned descendant of the GPT-3 base, per OpenAI's InstructGPT paper and the ChatGPT launch post.Post-trNew base — GPT-3.5 → GPT-4 OpenAI introduced GPT-4 as a new base model, not a GPT-3.5 fine-tune.New basePost-training — GPT-4 → GPT-4 Turbo OpenAI documented GPT-4 Turbo at DevDay 2023 as the same generation with extended context and updated training data.Post-trNew base — GPT-4 Turbo → GPT-4o OpenAI introduced GPT-4o as a new natively-multimodal base model.New baseDistilled from — GPT-4o → GPT-4o mini OpenAI positioned GPT-4o mini as the smaller, distilled sibling of GPT-4o.DistillSuccession — GPT-4o → GPT-4.1New base — GPT-4o → o1-preview OpenAI introduced the o-series as a separate reasoning track; o1 was trained to use long chain-of-thought at inference time.New baseSuccession — o1-preview → o1Succession — o1 → o3-miniSuccession — o3-mini → o3Distilled from — o3 → o4-mini OpenAI positioned o4-mini as the smaller sibling shipped alongside o3.DistillLines merged — GPT-4.1 → GPT-5 OpenAI introduced GPT-5 as a unified router that combines the GPT chat line and the o-series reasoning track into one model id.MergedLines merged — o3 → GPT-5MergedSuccession — GPT-5 → GPT-5.1Succession — GPT-5.1 → GPT-5.2Succession — GPT-5.2 → GPT-5.3-CodexPost-training — GPT-5.2 → GPT-5.3 Instant OpenAI documented GPT-5.3 Instant (March 3, 2026) as a ChatGPT default update focused on conversational quality, separate from the Codex-specialized GPT-5.3-Codex shipped two days earlier — sourced to the GPT-5.3 Instant announcement at openai.com/index/gpt-5-3-instant/.Post-trSuccession — GPT-5.3-Codex → GPT-5.4Succession — GPT-5.4 → GPT-5.5GPT-3GPT-3 · May 28, 2020 · LegacyGPT-3.5GPT-3.5 · Nov 30, 2022 · LegacyGPT-4GPT-4 · Mar 14, 2023 · LegacyGPT-4 TurboGPT-4 Turbo · Nov 6, 2023 · LegacyGPT-4oGPT-4o · May 13, 2024 · LegacyGPT-4o miniGPT-4o mini · Jul 18, 2024 · LegacyGPT-4.1GPT-4.1 · Apr 14, 2025 · Legacyo1-previewo1-preview · Sep 12, 2024 · Legacyo1o1 · Dec 5, 2024 · Legacyo3-minio3-mini · Jan 31, 2025 · Legacyo3o3 · Apr 16, 2025 · Legacyo4-minio4-mini · Apr 16, 2025 · LegacyGPT-5GPT-5 · Aug 7, 2025 · LegacyGPT-5.1GPT-5.1 · Nov 12, 2025 · LegacyGPT-5.2GPT-5.2 · Dec 11, 2025 · AvailableGPT-5.3-CodexGPT-5.3-Codex · Feb 5, 2026 · AvailableGPT-5.3 InstantGPT-5.3 Instant · Mar 3, 2026 · AvailableGPT-5.4GPT-5.4 · Mar 5, 2026 · AvailableGPT-5.5GPT-5.5 · Apr 23, 2026 · Current
Lineage as text (19 edges) ↓

Every edge in the ChatGPT tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • GPT-3.5 (Nov 30, 2022)post-training of GPT-3 — OpenAI's GPT-3.5 (text-davinci series and ChatGPT launch) was an RLHF-fine-tuned descendant of the GPT-3 base, per OpenAI's InstructGPT paper and the ChatGPT launch post.
  • GPT-4 (Mar 14, 2023)new base of GPT-3.5 — OpenAI introduced GPT-4 as a new base model, not a GPT-3.5 fine-tune.
  • GPT-4 Turbo (Nov 6, 2023)post-training of GPT-4 — OpenAI documented GPT-4 Turbo at DevDay 2023 as the same generation with extended context and updated training data.
  • GPT-4o (May 13, 2024)new base of GPT-4 Turbo — OpenAI introduced GPT-4o as a new natively-multimodal base model.
  • GPT-4o mini (Jul 18, 2024)distilled from of GPT-4o — OpenAI positioned GPT-4o mini as the smaller, distilled sibling of GPT-4o.
  • o1-preview (Sep 12, 2024)new base of GPT-4o — OpenAI introduced the o-series as a separate reasoning track; o1 was trained to use long chain-of-thought at inference time.
  • o1 (Dec 5, 2024)succession of o1-preview
  • o3-mini (Jan 31, 2025)succession of o1
  • GPT-4.1 (Apr 14, 2025)succession of GPT-4o
  • o4-mini (Apr 16, 2025)distilled from of o3 — OpenAI positioned o4-mini as the smaller sibling shipped alongside o3.
  • o3 (Apr 16, 2025)succession of o3-mini
  • GPT-5 (Aug 7, 2025)lines merged of GPT-4.1 — OpenAI introduced GPT-5 as a unified router that combines the GPT chat line and the o-series reasoning track into one model id.
  • GPT-5 (Aug 7, 2025)lines merged of o3
  • GPT-5.1 (Nov 12, 2025)succession of GPT-5
  • GPT-5.2 (Dec 11, 2025)succession of GPT-5.1
  • GPT-5.3-Codex (Feb 5, 2026)succession of GPT-5.2
  • GPT-5.3 Instant (Mar 3, 2026)post-training of GPT-5.2 — OpenAI documented GPT-5.3 Instant (March 3, 2026) as a ChatGPT default update focused on conversational quality, separate from the Codex-specialized GPT-5.3-Codex shipped two days earlier — sourced to the GPT-5.3 Instant announcement at openai.com/index/gpt-5-3-instant/.
  • GPT-5.4 (Mar 5, 2026)succession of GPT-5.3-Codex
  • GPT-5.5 (Apr 23, 2026)succession of GPT-5.4

Google · Gemini

Versions page →
ProFlashLite2023202420252026New base — Bard (LaMDA) → PaLM 2 Google introduced PaLM 2 as a new base model powering an upgraded Bard.New baseNew base — PaLM 2 → Gemini 1.0 Pro Google's Gemini 1.0 was a new natively-multimodal architecture, not a PaLM 2 fine-tune.New baseSuccession — Gemini 1.0 Pro → Gemini 1.0 UltraNew base — Gemini 1.0 Ultra → Gemini 1.5 Pro Google introduced Gemini 1.5 with a new mixture-of-experts architecture and the 1M context, not a 1.0 fine-tune.New baseDistilled from — Gemini 1.5 Pro → Gemini 1.5 Flash Google described Gemini 1.5 Flash as a distilled smaller sibling of 1.5 Pro.DistillSuccession — Gemini 1.5 Flash → Gemini 2.0 FlashSuccession — Gemini 1.5 Pro → Gemini 2.0 Pro ExpSuccession — Gemini 2.0 Pro Exp → Gemini 2.5 ProSuccession — Gemini 2.0 Flash → Gemini 2.5 FlashDistilled from — Gemini 2.5 Flash → 2.5 Flash-Lite Google positioned Flash-Lite as the smallest sibling of the 2.5 Flash family.DistillNew base — Gemini 2.5 Pro → Gemini 3 Pro Google introduced Gemini 3 as a new generation flagship.New baseSuccession — Gemini 2.5 Flash → Gemini 3 FlashSuccession — Gemini 3 Pro → Gemini 3.1 ProDistilled from — Gemini 3 Flash → 3.1 Flash-LiteDistillNew base — Gemini 3 Flash → Gemini 3.5 Flash Google introduced Gemini 3.5 Flash (May 19, 2026) at Google I/O as the first Flash to outperform the prior generation's Pro flagship on hard coding and agentic benchmarks — a new generation in the Flash track. Per the blog.google announcement, it powers Gemini Spark and is the new default behind the Gemini app and Search AI Mode.New baseBard (LaMDA)Bard (LaMDA) · Mar 21, 2023 · LegacyPaLM 2PaLM 2 · May 10, 2023 · LegacyGemini 1.0 ProGemini 1.0 Pro · Dec 6, 2023 · LegacyGemini 1.0 UltraGemini 1.0 Ultra · Feb 8, 2024 · LegacyGemini 1.5 ProGemini 1.5 Pro · Feb 15, 2024 · LegacyGemini 1.5 FlashGemini 1.5 Flash · May 14, 2024 · LegacyGemini 2.0 FlashGemini 2.0 Flash · Dec 11, 2024 · LegacyGemini 2.0 Pro ExpGemini 2.0 Pro Exp · Feb 5, 2025 · LegacyGemini 2.5 ProGemini 2.5 Pro · Mar 25, 2025 · AvailableGemini 2.5 FlashGemini 2.5 Flash · Jun 17, 2025 · Available2.5 Flash-Lite2.5 Flash-Lite · Jul 22, 2025 · AvailableGemini 3 ProGemini 3 Pro · Nov 18, 2025 · LegacyGemini 3 FlashGemini 3 Flash · Dec 17, 2025 · AvailableGemini 3.1 ProGemini 3.1 Pro · Feb 19, 2026 · Current3.1 Flash-Lite3.1 Flash-Lite · Mar 3, 2026 · AvailableGemini 3.5 FlashGemini 3.5 Flash · May 19, 2026 · Current
Lineage as text (15 edges) ↓

Every edge in the Gemini tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

FlagshipSpecialized2023202420252026Post-training — Grok 1 → Grok 1.5 xAI documented Grok 1.5 as a successor that extended Grok 1's context and reasoning, broadly within the same generation.Post-trNew base — Grok 1.5 → Grok 2 xAI introduced Grok 2 as a new base.New baseNew base — Grok 2 → Grok 3 xAI introduced Grok 3 trained on the Memphis Colossus cluster as a new base.New baseNew base — Grok 3 → Grok 4New baseNew base — Grok 4 → grok-code-fast-1 xAI documented grok-code-fast-1 (August 28, 2025) as a from-scratch architecture optimized for agentic coding rather than a fine-tune of Grok 4 — per the x.ai/news/grok-code-fast-1 announcement.New baseDistilled from — Grok 4 → Grok 4 Fast xAI positioned Grok 4 Fast (September 19, 2025) as a faster, cheaper Grok 4 with a 2M-token context window — per the x.ai/news/grok-4-fast announcement.DistillPost-training — Grok 4 → Grok 4.1Post-trDistilled from — Grok 4.1 → Grok 4.1 Fast xAI positioned Grok 4.1 Fast as a smaller, faster sibling shipped two days after 4.1.DistillSuccession — Grok 4.1 → Grok 4.20Succession — Grok 4.20 → Grok 4.3Succession — grok-code-fast-1 → Grok Build 0.1 xAI documented Grok Build 0.1 (early-access launch May 19, 2026; the Grok Build CLI / TUI launched in beta five days earlier on May 14, 2026) as the successor to grok-code-fast-1, purpose-built for agentic coding workflows — per the x.ai/news/grok-build-0-1 announcement and the May 15, 2026 deprecation of grok-code-fast-1.Succession — Grok Build 0.1 → Composer 2.5 xAI shipped Composer 2.5 (June 1, 2026) inside the Grok Build /model menu as a fast agentic-coding sibling to Grok Build 0.1; the original launch coverage identified Composer 2.5 as built on the open-source Kimi K2.5 checkpoint (Moonshot AI) and post-trained with roughly 25× more synthetic agentic tasks than Composer 2, so the line of descent is external — xAI does not disclose Composer 2.5 as a fine-tune, post-training, or distillation of any prior xAI model. The x.ai/news/composer-2-5 page itself has since been pared back; the Kimi K2.5 attribution is preserved on /ai/grok/versions/#grok-composer-2-5 with the original launch citation.Grok 1Grok 1 · Nov 4, 2023 · LegacyGrok 1.5Grok 1.5 · Mar 28, 2024 · LegacyGrok 2Grok 2 · Aug 13, 2024 · LegacyGrok 3Grok 3 · Feb 17, 2025 · AvailableGrok 4Grok 4 · Jul 9, 2025 · Availablegrok-code-fast-1grok-code-fast-1 · Aug 28, 2025 · AvailableGrok 4 FastGrok 4 Fast · Sep 19, 2025 · AvailableGrok 4.1Grok 4.1 · Nov 17, 2025 · LegacyGrok 4.1 FastGrok 4.1 Fast · Nov 19, 2025 · AvailableGrok 4.20Grok 4.20 · Mar 10, 2026 · AvailableGrok 4.3Grok 4.3 · Apr 17, 2026 · CurrentGrok Build 0.1Grok Build 0.1 · May 19, 2026 · CurrentComposer 2.5Composer 2.5 · Jun 1, 2026 · Current
Lineage as text (12 edges) ↓

Every edge in the Grok tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • Grok 1.5 (Mar 28, 2024)post-training of Grok 1 — xAI documented Grok 1.5 as a successor that extended Grok 1's context and reasoning, broadly within the same generation.
  • Grok 2 (Aug 13, 2024)new base of Grok 1.5 — xAI introduced Grok 2 as a new base.
  • Grok 3 (Feb 17, 2025)new base of Grok 2 — xAI introduced Grok 3 trained on the Memphis Colossus cluster as a new base.
  • Grok 4 (Jul 9, 2025)new base of Grok 3
  • grok-code-fast-1 (Aug 28, 2025)new base of Grok 4 — xAI documented grok-code-fast-1 (August 28, 2025) as a from-scratch architecture optimized for agentic coding rather than a fine-tune of Grok 4 — per the x.ai/news/grok-code-fast-1 announcement.
  • Grok 4 Fast (Sep 19, 2025)distilled from of Grok 4 — xAI positioned Grok 4 Fast (September 19, 2025) as a faster, cheaper Grok 4 with a 2M-token context window — per the x.ai/news/grok-4-fast announcement.
  • Grok 4.1 (Nov 17, 2025)post-training of Grok 4
  • Grok 4.1 Fast (Nov 19, 2025)distilled from of Grok 4.1 — xAI positioned Grok 4.1 Fast as a smaller, faster sibling shipped two days after 4.1.
  • Grok 4.20 (Mar 10, 2026)succession of Grok 4.1
  • Grok 4.3 (Apr 17, 2026)succession of Grok 4.20
  • Grok Build 0.1 (May 19, 2026)succession of grok-code-fast-1 — xAI documented Grok Build 0.1 (early-access launch May 19, 2026; the Grok Build CLI / TUI launched in beta five days earlier on May 14, 2026) as the successor to grok-code-fast-1, purpose-built for agentic coding workflows — per the x.ai/news/grok-build-0-1 announcement and the May 15, 2026 deprecation of grok-code-fast-1.
  • Composer 2.5 (Jun 1, 2026)succession of Grok Build 0.1 — xAI shipped Composer 2.5 (June 1, 2026) inside the Grok Build /model menu as a fast agentic-coding sibling to Grok Build 0.1; the original launch coverage identified Composer 2.5 as built on the open-source Kimi K2.5 checkpoint (Moonshot AI) and post-trained with roughly 25× more synthetic agentic tasks than Composer 2, so the line of descent is external — xAI does not disclose Composer 2.5 as a fine-tune, post-training, or distillation of any prior xAI model. The x.ai/news/composer-2-5 page itself has since been pared back; the Kimi K2.5 attribution is preserved on /ai/grok/versions/#grok-composer-2-5 with the original launch citation.

Meta · Llama

Versions page →
FlagshipSpecializedClosed2023202420252026New base — LLaMA 1 → Llama 2 Meta's Llama 2 paper documents it as a new pretraining run with substantially larger data and the open license shift.New baseFine-tune — Llama 2 → Code Llama Meta's Code Llama paper documents it as a fine-tune of Llama 2 on code data.Fine-tFine-tune — Code Llama → Code Llama 70B Meta's Code Llama 70B release (January 29, 2024) is described as a continued-pretrained / fine-tuned 70B variant of the Code Llama line, sourced from the Llama 2 70B base — per the ai.meta.com Code Llama 70B announcement.Fine-tNew base — Llama 2 → Llama 3 Meta's Llama 3 release notes document it as a new pretraining run with updated tokenizer.New basePost-training — Llama 3 → Llama 3.1 Meta's Llama 3.1 release added the 405B size and the long-context post-training; documented as same generation.Post-trFine-tune — Llama 3.1 → Llama Guard 3 Meta's Llama Guard 3 family (July 23, 2024, alongside Llama 3.1) is a safeguard fine-tune of the Llama 3.1 base model line — per the Llama 3.1 release notes documenting Guard 3 8B / 1B / 11B-Vision as classifier fine-tunes.Fine-tPost-training — Llama 3.1 → Llama 3.2 Meta's Llama 3.2 release added vision and edge-sized variants.Post-trPost-training — Llama 3.2 → Llama 3.3 70BPost-trNew base — Llama 3.3 70B → Llama 4 Scout Meta's Llama 4 release introduced a new mixture-of-experts architecture as a new base.New baseNew base — Llama 3.3 70B → Llama 4 MaverickNew baseFine-tune — Llama 4 Maverick → Llama Guard 4 Meta's Llama Guard 4 12B (April 29, 2025) is a multimodal safeguard fine-tune released alongside the Llama 4 family — per the Llama 4 family release notes documenting Guard 4 as a Llama-4-line classifier.Fine-tNew base — Llama 4 Maverick → Muse Spark Meta described Muse Spark as the closed-weights successor to the Llama line, trained at Meta Superintelligence Labs.New baseLLaMA 1LLaMA 1 · Mar 3, 2023 · LegacyLlama 2Llama 2 · Jul 18, 2023 · LegacyCode LlamaCode Llama · Aug 24, 2023 · LegacyCode Llama 70BCode Llama 70B · Jan 29, 2024 · AvailableLlama 3Llama 3 · Apr 18, 2024 · LegacyLlama 3.1Llama 3.1 · Jul 23, 2024 · AvailableLlama Guard 3Llama Guard 3 · Jul 23, 2024 · AvailableLlama 3.2Llama 3.2 · Sep 25, 2024 · AvailableLlama 3.3 70BLlama 3.3 70B · Dec 6, 2024 · AvailableLlama 4 ScoutLlama 4 Scout · Apr 5, 2025 · AvailableLlama 4 MaverickLlama 4 Maverick · Apr 5, 2025 · CurrentLlama Guard 4Llama Guard 4 · Apr 29, 2025 · AvailableMuse SparkMuse Spark · Apr 8, 2026 · Current
Lineage as text (12 edges) ↓

Every edge in the Llama tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • Llama 2 (Jul 18, 2023)new base of LLaMA 1 — Meta's Llama 2 paper documents it as a new pretraining run with substantially larger data and the open license shift.
  • Code Llama (Aug 24, 2023)fine-tune of Llama 2 — Meta's Code Llama paper documents it as a fine-tune of Llama 2 on code data.
  • Code Llama 70B (Jan 29, 2024)fine-tune of Code Llama — Meta's Code Llama 70B release (January 29, 2024) is described as a continued-pretrained / fine-tuned 70B variant of the Code Llama line, sourced from the Llama 2 70B base — per the ai.meta.com Code Llama 70B announcement.
  • Llama 3 (Apr 18, 2024)new base of Llama 2 — Meta's Llama 3 release notes document it as a new pretraining run with updated tokenizer.
  • Llama 3.1 (Jul 23, 2024)post-training of Llama 3 — Meta's Llama 3.1 release added the 405B size and the long-context post-training; documented as same generation.
  • Llama Guard 3 (Jul 23, 2024)fine-tune of Llama 3.1 — Meta's Llama Guard 3 family (July 23, 2024, alongside Llama 3.1) is a safeguard fine-tune of the Llama 3.1 base model line — per the Llama 3.1 release notes documenting Guard 3 8B / 1B / 11B-Vision as classifier fine-tunes.
  • Llama 3.2 (Sep 25, 2024)post-training of Llama 3.1 — Meta's Llama 3.2 release added vision and edge-sized variants.
  • Llama 3.3 70B (Dec 6, 2024)post-training of Llama 3.2
  • Llama 4 Scout (Apr 5, 2025)new base of Llama 3.3 70B — Meta's Llama 4 release introduced a new mixture-of-experts architecture as a new base.
  • Llama 4 Maverick (Apr 5, 2025)new base of Llama 3.3 70B
  • Llama Guard 4 (Apr 29, 2025)fine-tune of Llama 4 Maverick — Meta's Llama Guard 4 12B (April 29, 2025) is a multimodal safeguard fine-tune released alongside the Llama 4 family — per the Llama 4 family release notes documenting Guard 4 as a Llama-4-line classifier.
  • Muse Spark (Apr 8, 2026)new base of Llama 4 Maverick — Meta described Muse Spark as the closed-weights successor to the Llama line, trained at Meta Superintelligence Labs.

DeepSeek · DeepSeek

Versions page →
FlagshipReasoningCode2023202420252026New base — DeepSeek-LLM → DeepSeek-V2 DeepSeek-V2 paper introduced the DeepSeekMoE architecture and Multi-head Latent Attention as new base.New baseFine-tune — DeepSeek-LLM → DeepSeek-Coder DeepSeek-Coder shipped alongside the base LLM as a code-specialized fine-tune.Fine-tNew base — DeepSeek-V2 → DeepSeek-V3 DeepSeek-V3 paper introduced the 671B-parameter MoE as a new base.New basePost-training — DeepSeek-V3 → DeepSeek-R1 DeepSeek-R1 paper documents R1 as RL post-training over the V3 base — RL-only emergent chain-of-thought.Post-trPost-training — DeepSeek-V3 → DeepSeek-V3-0324 DeepSeek-V3-0324 release (March 24, 2025) is documented as an MIT-relicensed update to the V3 base with improved reasoning / coding / tool use — per the HuggingFace card at deepseek-ai/DeepSeek-V3-0324.Post-trPost-training — DeepSeek-R1 → DeepSeek-R1-0528 DeepSeek-R1-0528 (May 28, 2025) is documented as an R1 update with the same 671B / 37B-active MoE architecture, improved reasoning depth and tool use — per the HuggingFace card at deepseek-ai/DeepSeek-R1-0528.Post-trPost-training — DeepSeek-V3-0324 → DeepSeek-V3.1 DeepSeek-V3.1 (August 21, 2025) merged the V-series and R-series into a hybrid Thinking / Non-Thinking architecture — per the api-docs.deepseek.com/news/news250821 release note (the generic api-docs.deepseek.com thinking-mode guide was subsequently rewritten around V4-pro).Post-trLines merged — DeepSeek-R1-0528 → DeepSeek-V3.1 DeepSeek documented V3.1's hybrid Thinking / Non-Thinking architecture as the architectural convergence of the V-series and the standalone R-series — no further standalone R-series releases shipped after R1-0528.MergedNew base — DeepSeek-V3.1 → DeepSeek-V3.2-Exp DeepSeek-V3.2-Exp (September 29, 2025) introduced DeepSeek Sparse Attention (DSA) as an explicitly experimental release branched off V3.1 — per the github.com/deepseek-ai/DeepSeek-V3.2-Exp release notes.New basePost-training — DeepSeek-V3.2-Exp → DeepSeek-V3.2 DeepSeek-V3.2 stable (December 1, 2025) productionized the DSA recipe from V3.2-Exp into the 685B-parameter MoE flagship — per the arXiv 2512.02556 technical paper.Post-trNew base — DeepSeek-V3.2 → DeepSeek-V4-Pro DeepSeek-V4 introduced a new MoE base with Thinking and Non-Thinking modes.New baseDistilled from — DeepSeek-V4-Pro → DeepSeek-V4-Flash DeepSeek positioned V4-Flash as the smaller sibling shipped alongside V4-Pro.DistillDeepSeek-LLMDeepSeek-LLM · Nov 2, 2023 · LegacyDeepSeek-CoderDeepSeek-Coder · Nov 2, 2023 · LegacyDeepSeek-V2DeepSeek-V2 · May 6, 2024 · LegacyDeepSeek-V3DeepSeek-V3 · Dec 26, 2024 · LegacyDeepSeek-R1DeepSeek-R1 · Jan 20, 2025 · LegacyDeepSeek-V3-0324DeepSeek-V3-0324 · Mar 24, 2025 · LegacyDeepSeek-R1-0528DeepSeek-R1-0528 · May 28, 2025 · AvailableDeepSeek-V3.1DeepSeek-V3.1 · Aug 21, 2025 · LegacyDeepSeek-V3.2-ExpDeepSeek-V3.2-Exp · Sep 29, 2025 · LegacyDeepSeek-V3.2DeepSeek-V3.2 · Dec 1, 2025 · AvailableDeepSeek-V4-ProDeepSeek-V4-Pro · Apr 24, 2026 · CurrentDeepSeek-V4-FlashDeepSeek-V4-Flash · Apr 24, 2026 · Current
Lineage as text (12 edges) ↓

Every edge in the DeepSeek tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • DeepSeek-Coder (Nov 2, 2023)fine-tune of DeepSeek-LLM — DeepSeek-Coder shipped alongside the base LLM as a code-specialized fine-tune.
  • DeepSeek-V2 (May 6, 2024)new base of DeepSeek-LLM — DeepSeek-V2 paper introduced the DeepSeekMoE architecture and Multi-head Latent Attention as new base.
  • DeepSeek-V3 (Dec 26, 2024)new base of DeepSeek-V2 — DeepSeek-V3 paper introduced the 671B-parameter MoE as a new base.
  • DeepSeek-R1 (Jan 20, 2025)post-training of DeepSeek-V3 — DeepSeek-R1 paper documents R1 as RL post-training over the V3 base — RL-only emergent chain-of-thought.
  • DeepSeek-V3-0324 (Mar 24, 2025)post-training of DeepSeek-V3 — DeepSeek-V3-0324 release (March 24, 2025) is documented as an MIT-relicensed update to the V3 base with improved reasoning / coding / tool use — per the HuggingFace card at deepseek-ai/DeepSeek-V3-0324.
  • DeepSeek-R1-0528 (May 28, 2025)post-training of DeepSeek-R1 — DeepSeek-R1-0528 (May 28, 2025) is documented as an R1 update with the same 671B / 37B-active MoE architecture, improved reasoning depth and tool use — per the HuggingFace card at deepseek-ai/DeepSeek-R1-0528.
  • DeepSeek-V3.1 (Aug 21, 2025)lines merged of DeepSeek-R1-0528 — DeepSeek documented V3.1's hybrid Thinking / Non-Thinking architecture as the architectural convergence of the V-series and the standalone R-series — no further standalone R-series releases shipped after R1-0528.
  • DeepSeek-V3.1 (Aug 21, 2025)post-training of DeepSeek-V3-0324 — DeepSeek-V3.1 (August 21, 2025) merged the V-series and R-series into a hybrid Thinking / Non-Thinking architecture — per the api-docs.deepseek.com/news/news250821 release note (the generic api-docs.deepseek.com thinking-mode guide was subsequently rewritten around V4-pro).
  • DeepSeek-V3.2-Exp (Sep 29, 2025)new base of DeepSeek-V3.1 — DeepSeek-V3.2-Exp (September 29, 2025) introduced DeepSeek Sparse Attention (DSA) as an explicitly experimental release branched off V3.1 — per the github.com/deepseek-ai/DeepSeek-V3.2-Exp release notes.
  • DeepSeek-V3.2 (Dec 1, 2025)post-training of DeepSeek-V3.2-Exp — DeepSeek-V3.2 stable (December 1, 2025) productionized the DSA recipe from V3.2-Exp into the 685B-parameter MoE flagship — per the arXiv 2512.02556 technical paper.
  • DeepSeek-V4-Pro (Apr 24, 2026)new base of DeepSeek-V3.2 — DeepSeek-V4 introduced a new MoE base with Thinking and Non-Thinking modes.
  • DeepSeek-V4-Flash (Apr 24, 2026)distilled from of DeepSeek-V4-Pro — DeepSeek positioned V4-Flash as the smaller sibling shipped alongside V4-Pro.

Mistral · Mistral

Versions page →
OpenProprietarySpecialized2023202420252026New base — Mistral 7B → Mixtral 8x7B Mixtral 8x7B introduced the sparse mixture-of-experts architecture as a new base, not a Mistral 7B fine-tune.New baseNew base — Mixtral 8x7B → Mistral Large Mistral Large was released as a proprietary flagship under the Mistral Research / commercial license, not a Mixtral fine-tune.New baseNew base — Mistral Large → Mistral Large 2New baseSuccession — Mistral Large 2 → Codestral 25.01 Codestral 25.01 (January 13, 2025) is documented as the next-generation Codestral coding flagship — per mistral.ai/news/codestral-2501. Mistral does not disclose Codestral 25.01 as a fine-tune, post-training, or distillation of Mistral Large 2 (the Codestral line is its own track since the original Codestral 22B, May 2024).New base — Mistral Large 2 → Mistral Small 3 Mistral Small 3 marked the December 2025 'Mistral 3' family relaunch under Apache 2.0 with new bases.New baseSuccession — Mistral Small 3 → Mistral Medium 3 Mistral Medium 3 (May 7, 2025) is the mid-tier proprietary flagship — per mistral.ai/news/mistral-medium-3.Fine-tune — Mistral Small 3 → Devstral Small Devstral Small (May 21, 2025) is documented as a 24B coding-agent fine-tune of Mistral Small 3.1 — per mistral.ai/news/devstral.Fine-tFine-tune — Mistral Small 3 → Magistral Magistral Small (June 10, 2025) is fine-tuned for multi-step reasoning with traceable chain-of-thought, building on the Mistral Small 3 lineage — per mistral.ai/news/magistral (arXiv 2506.10910).Fine-tSuccession — Codestral 25.01 → Codestral 25.08 Codestral 25.08 (August 2025) is the next Codestral flagship — per mistral.ai/news/codestral-25-08.Succession — Mistral Medium 3 → Mistral Large 3Succession — Mistral Small 3 → Ministral 3 Ministral 3 (December 2, 2025) is the small-end open-weights line in the Mistral 3 family relaunch — per the mistral.ai/news/mistral-3 announcement.Succession — Devstral Small → Devstral 2 Devstral 2 (December 10, 2025) is documented as the next-generation Devstral coding-agent line — per mistral.ai/news/devstral-2-vibe-cli.Succession — Mistral Large 3 → Mistral OCR 3 Mistral OCR 3 (December 17, 2025) ships alongside the Mistral 3 family on la Plateforme as the upgraded structured-document model; Mistral's announcement positions OCR 3 as a major upgrade over Mistral OCR 2 (74% win rate) and does not document any lineage from Mistral Large 3 — per mistral.ai/news/mistral-ocr-3. The OCR-3 ↔ Mistral 3 family edge here is a co-shipped successor relationship, not a disclosed base-model claim.Succession — Mistral Large 3 → Mistral Small 4New base — Mistral Small 4 → Mistral Med 3.5 Mistral Medium 3.5 (April 28, 2026) is documented as a 128B dense first 'flagship merged model' (chat / reasoning / coding / vision in one weight set) — per the model card at docs.mistral.ai/models/model-cards/mistral-medium-3-5-26-04.New baseMistral 7BMistral 7B · Sep 27, 2023 · LegacyMixtral 8x7BMixtral 8x7B · Dec 11, 2023 · LegacyMistral LargeMistral Large · Feb 26, 2024 · LegacyMistral Large 2Mistral Large 2 · Jul 24, 2024 · LegacyCodestral 25.01Codestral 25.01 · Jan 13, 2025 · LegacyMistral Small 3Mistral Small 3 · Jan 30, 2025 · LegacyMistral Medium 3Mistral Medium 3 · May 7, 2025 · AvailableDevstral SmallDevstral Small · May 21, 2025 · LegacyMagistralMagistral · Jun 10, 2025 · AvailableCodestral 25.08Codestral 25.08 · Jul 30, 2025 · CurrentMistral Large 3Mistral Large 3 · Dec 2, 2025 · CurrentMinistral 3Ministral 3 · Dec 2, 2025 · AvailableDevstral 2Devstral 2 · Dec 9, 2025 · AvailableMistral OCR 3Mistral OCR 3 · Dec 17, 2025 · CurrentMistral Small 4Mistral Small 4 · Mar 16, 2026 · CurrentMistral Med 3.5Mistral Med 3.5 · Apr 28, 2026 · Current
Lineage as text (15 edges) ↓

Every edge in the Mistral tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • Mixtral 8x7B (Dec 11, 2023)new base of Mistral 7B — Mixtral 8x7B introduced the sparse mixture-of-experts architecture as a new base, not a Mistral 7B fine-tune.
  • Mistral Large (Feb 26, 2024)new base of Mixtral 8x7B — Mistral Large was released as a proprietary flagship under the Mistral Research / commercial license, not a Mixtral fine-tune.
  • Mistral Large 2 (Jul 24, 2024)new base of Mistral Large
  • Codestral 25.01 (Jan 13, 2025)succession of Mistral Large 2 — Codestral 25.01 (January 13, 2025) is documented as the next-generation Codestral coding flagship — per mistral.ai/news/codestral-2501. Mistral does not disclose Codestral 25.01 as a fine-tune, post-training, or distillation of Mistral Large 2 (the Codestral line is its own track since the original Codestral 22B, May 2024).
  • Mistral Small 3 (Jan 30, 2025)new base of Mistral Large 2 — Mistral Small 3 marked the December 2025 'Mistral 3' family relaunch under Apache 2.0 with new bases.
  • Mistral Medium 3 (May 7, 2025)succession of Mistral Small 3 — Mistral Medium 3 (May 7, 2025) is the mid-tier proprietary flagship — per mistral.ai/news/mistral-medium-3.
  • Devstral Small (May 21, 2025)fine-tune of Mistral Small 3 — Devstral Small (May 21, 2025) is documented as a 24B coding-agent fine-tune of Mistral Small 3.1 — per mistral.ai/news/devstral.
  • Magistral (Jun 10, 2025)fine-tune of Mistral Small 3 — Magistral Small (June 10, 2025) is fine-tuned for multi-step reasoning with traceable chain-of-thought, building on the Mistral Small 3 lineage — per mistral.ai/news/magistral (arXiv 2506.10910).
  • Codestral 25.08 (Jul 30, 2025)succession of Codestral 25.01 — Codestral 25.08 (August 2025) is the next Codestral flagship — per mistral.ai/news/codestral-25-08.
  • Mistral Large 3 (Dec 2, 2025)succession of Mistral Medium 3
  • Ministral 3 (Dec 2, 2025)succession of Mistral Small 3 — Ministral 3 (December 2, 2025) is the small-end open-weights line in the Mistral 3 family relaunch — per the mistral.ai/news/mistral-3 announcement.
  • Devstral 2 (Dec 9, 2025)succession of Devstral Small — Devstral 2 (December 10, 2025) is documented as the next-generation Devstral coding-agent line — per mistral.ai/news/devstral-2-vibe-cli.
  • Mistral OCR 3 (Dec 17, 2025)succession of Mistral Large 3 — Mistral OCR 3 (December 17, 2025) ships alongside the Mistral 3 family on la Plateforme as the upgraded structured-document model; Mistral's announcement positions OCR 3 as a major upgrade over Mistral OCR 2 (74% win rate) and does not document any lineage from Mistral Large 3 — per mistral.ai/news/mistral-ocr-3. The OCR-3 ↔ Mistral 3 family edge here is a co-shipped successor relationship, not a disclosed base-model claim.
  • Mistral Small 4 (Mar 16, 2026)succession of Mistral Large 3
  • Mistral Med 3.5 (Apr 28, 2026)new base of Mistral Small 4 — Mistral Medium 3.5 (April 28, 2026) is documented as a 128B dense first 'flagship merged model' (chat / reasoning / coding / vision in one weight set) — per the model card at docs.mistral.ai/models/model-cards/mistral-medium-3-5-26-04.

Alibaba · Qwen

Versions page →
FlagshipReasoningProprietary2023202420252026Succession — Qwen-7B → Qwen-14BNew base — Qwen-14B → Qwen2 family Qwen2 marked the June 2024 architecture refresh with the Apache 2.0 licensing turn for most variants.New basePost-training — Qwen2 family → Qwen2.5 familyPost-trFine-tune — Qwen2.5 family → QwQ-32B Preview QwQ-32B-Preview (November 28, 2024) is the first Qwen reasoning model, a 32B dense Apache 2.0 fine-tune positioned against o1-preview — per qwenlm.github.io/blog/qwq-32b.Fine-tSuccession — Qwen2.5 family → Qwen2.5-Max Qwen2.5-Max (January 29, 2025) is the first proprietary closed-weights Qwen-Max release — per qwenlm.github.io/blog/qwen2.5-max.Post-training — QwQ-32B Preview → QwQ-32B QwQ-32B (March 5, 2025) is the production successor to QwQ-32B-Preview, trained with RL for chain-of-thought reasoning — per qwenlm.github.io/blog/qwq-32b.Post-trNew base — Qwen2.5 family → Qwen3 family Qwen3 introduced the hybrid Thinking / Non-Thinking architecture, absorbing the standalone QwQ reasoning track.New baseLines merged — QwQ-32B → Qwen3 family Qwen3 absorbed the QwQ standalone-reasoning track into the hybrid Thinking / Non-Thinking architecture — no further standalone QwQ rows are expected per qwen3 release post.MergedFine-tune — Qwen3 family → Qwen3-Coder Qwen3-Coder (July 22, 2025) is the open-weights agentic-coding flagship built on the Qwen3 lineage, 480B/35B-active MoE — per qwenlm.github.io/blog/qwen3-coder.Fine-tNew base — Qwen3 family → Qwen3-Next 80B Qwen3-Next-80B-A3B (September 11, 2025) introduced a novel ultra-sparse MoE architecture (hybrid Gated DeltaNet + Gated Attention) as a new base — per the Alibaba Cloud Community blog post.New baseSuccession — Qwen3 family → Qwen3-Max Qwen3-Max is the largest Qwen3-generation proprietary flagship.Fine-tune — Qwen3 family → Qwen3-VL Qwen3-VL family (September 23, 2025) is the vision-language fine-tune of Qwen3, mirroring the hybrid-mode recipe at the vision-language layer — per arXiv 2511.21631.Fine-tFine-tune — Qwen3-Coder → Qwen3-Coder-Next Qwen3-Coder-Next (February 4, 2026) is built on Qwen3-Next-80B-A3B-Base, the ultra-sparse coding-agent line — per qwen.ai/blog?id=qwen3-coder-next.Fine-tSuccession — Qwen3-Next 80B → Qwen3.5 + Plus Qwen3.5 (February 16, 2026) carries forward the hybrid Gated DeltaNet + sparse MoE architecture established in Qwen3-Next — per the qwen3.5 release.Succession — Qwen3-Max → Qwen 3.6-Max Qwen 3.6-Max-Preview (April 2, 2026) is the next-generation proprietary closed-weights flagship, replacing Qwen3-Max in the Max product slot.Succession — Qwen3.5 + Plus → Qwen3.6-35B-A3B Qwen3.6-35B-A3B (April 16, 2026) is the first open-weights Qwen3.6 release, a 35B-total/3B-active MoE — per HuggingFace Qwen/Qwen3.6-35B-A3B.New base — Qwen3.6-35B-A3B → Qwen3.6-27B Qwen3.6-27B (April 22, 2026) introduced the Gated DeltaNet + self-attention hybrid as a new dense base with Thinking Preservation — per qwen.ai/blog?id=qwen3.6-27b.New baseSuccession — Qwen 3.6-Max → Qwen3.7-Max Qwen3.7-Max (May 20, 2026) is the closed-weights proprietary reasoning-agent flagship succeeding the 3.6-Max preview — per the Alibaba Cloud Summit announcement.Succession — Qwen3.7-Max → Qwen3.7-Plus Qwen3.7-Plus (May 31, 2026) is the multimodal vision + language sibling to the text-only Qwen3.7-Max, together forming the Qwen3.7 generation announced at the May 20, 2026 Alibaba Cloud Summit. Alibaba's announcement describes Plus as retaining Max's coding / tool-use / reasoning strengths while adding image and video understanding, but does not disclose it as a fine-tune, post-training, or distillation of Qwen3.7-Max — per qwen.ai/blog?id=qwen3.7-plus and the Alibaba Cloud Bailian / Model Studio Recommended models page.Qwen-7BQwen-7B · Aug 3, 2023 · LegacyQwen-14BQwen-14B · Sep 25, 2023 · LegacyQwen2 familyQwen2 family · Jun 7, 2024 · LegacyQwen2.5 familyQwen2.5 family · Sep 19, 2024 · LegacyQwQ-32B PreviewQwQ-32B Preview · Nov 28, 2024 · LegacyQwen2.5-MaxQwen2.5-Max · Jan 29, 2025 · LegacyQwQ-32BQwQ-32B · Mar 5, 2025 · AvailableQwen3 familyQwen3 family · Apr 28, 2025 · LegacyQwen3-CoderQwen3-Coder · Jul 22, 2025 · AvailableQwen3-Next 80BQwen3-Next 80B · Sep 11, 2025 · AvailableQwen3-MaxQwen3-Max · Sep 15, 2025 · LegacyQwen3-VLQwen3-VL · Sep 23, 2025 · AvailableQwen3-Coder-NextQwen3-Coder-Next · Feb 4, 2026 · AvailableQwen3.5 + PlusQwen3.5 + Plus · Feb 16, 2026 · AvailableQwen 3.6-MaxQwen 3.6-Max · Apr 2, 2026 · LegacyQwen3.6-35B-A3BQwen3.6-35B-A3B · Apr 16, 2026 · CurrentQwen3.6-27BQwen3.6-27B · Apr 22, 2026 · CurrentQwen3.7-MaxQwen3.7-Max · May 20, 2026 · CurrentQwen3.7-PlusQwen3.7-Plus · May 31, 2026 · Current
Lineage as text (19 edges) ↓

Every edge in the Qwen tree above, in chronological order. Each line shows: destination model, edge type, source model, and the provider documentation that establishes the relationship (where one was disclosed).

  • Qwen-14B (Sep 25, 2023)succession of Qwen-7B
  • Qwen2 family (Jun 7, 2024)new base of Qwen-14B — Qwen2 marked the June 2024 architecture refresh with the Apache 2.0 licensing turn for most variants.
  • Qwen2.5 family (Sep 19, 2024)post-training of Qwen2 family
  • QwQ-32B Preview (Nov 28, 2024)fine-tune of Qwen2.5 family — QwQ-32B-Preview (November 28, 2024) is the first Qwen reasoning model, a 32B dense Apache 2.0 fine-tune positioned against o1-preview — per qwenlm.github.io/blog/qwq-32b.
  • Qwen2.5-Max (Jan 29, 2025)succession of Qwen2.5 family — Qwen2.5-Max (January 29, 2025) is the first proprietary closed-weights Qwen-Max release — per qwenlm.github.io/blog/qwen2.5-max.
  • QwQ-32B (Mar 5, 2025)post-training of QwQ-32B Preview — QwQ-32B (March 5, 2025) is the production successor to QwQ-32B-Preview, trained with RL for chain-of-thought reasoning — per qwenlm.github.io/blog/qwq-32b.
  • Qwen3 family (Apr 28, 2025)new base of Qwen2.5 family — Qwen3 introduced the hybrid Thinking / Non-Thinking architecture, absorbing the standalone QwQ reasoning track.
  • Qwen3 family (Apr 28, 2025)lines merged of QwQ-32B — Qwen3 absorbed the QwQ standalone-reasoning track into the hybrid Thinking / Non-Thinking architecture — no further standalone QwQ rows are expected per qwen3 release post.
  • Qwen3-Coder (Jul 22, 2025)fine-tune of Qwen3 family — Qwen3-Coder (July 22, 2025) is the open-weights agentic-coding flagship built on the Qwen3 lineage, 480B/35B-active MoE — per qwenlm.github.io/blog/qwen3-coder.
  • Qwen3-Next 80B (Sep 11, 2025)new base of Qwen3 family — Qwen3-Next-80B-A3B (September 11, 2025) introduced a novel ultra-sparse MoE architecture (hybrid Gated DeltaNet + Gated Attention) as a new base — per the Alibaba Cloud Community blog post.
  • Qwen3-Max (Sep 15, 2025)succession of Qwen3 family — Qwen3-Max is the largest Qwen3-generation proprietary flagship.
  • Qwen3-VL (Sep 23, 2025)fine-tune of Qwen3 family — Qwen3-VL family (September 23, 2025) is the vision-language fine-tune of Qwen3, mirroring the hybrid-mode recipe at the vision-language layer — per arXiv 2511.21631.
  • Qwen3-Coder-Next (Feb 4, 2026)fine-tune of Qwen3-Coder — Qwen3-Coder-Next (February 4, 2026) is built on Qwen3-Next-80B-A3B-Base, the ultra-sparse coding-agent line — per qwen.ai/blog?id=qwen3-coder-next.
  • Qwen3.5 + Plus (Feb 16, 2026)succession of Qwen3-Next 80B — Qwen3.5 (February 16, 2026) carries forward the hybrid Gated DeltaNet + sparse MoE architecture established in Qwen3-Next — per the qwen3.5 release.
  • Qwen 3.6-Max (Apr 2, 2026)succession of Qwen3-Max — Qwen 3.6-Max-Preview (April 2, 2026) is the next-generation proprietary closed-weights flagship, replacing Qwen3-Max in the Max product slot.
  • Qwen3.6-35B-A3B (Apr 16, 2026)succession of Qwen3.5 + Plus — Qwen3.6-35B-A3B (April 16, 2026) is the first open-weights Qwen3.6 release, a 35B-total/3B-active MoE — per HuggingFace Qwen/Qwen3.6-35B-A3B.
  • Qwen3.6-27B (Apr 22, 2026)new base of Qwen3.6-35B-A3B — Qwen3.6-27B (April 22, 2026) introduced the Gated DeltaNet + self-attention hybrid as a new dense base with Thinking Preservation — per qwen.ai/blog?id=qwen3.6-27b.
  • Qwen3.7-Max (May 20, 2026)succession of Qwen 3.6-Max — Qwen3.7-Max (May 20, 2026) is the closed-weights proprietary reasoning-agent flagship succeeding the 3.6-Max preview — per the Alibaba Cloud Summit announcement.
  • Qwen3.7-Plus (May 31, 2026)succession of Qwen3.7-Max — Qwen3.7-Plus (May 31, 2026) is the multimodal vision + language sibling to the text-only Qwen3.7-Max, together forming the Qwen3.7 generation announced at the May 20, 2026 Alibaba Cloud Summit. Alibaba's announcement describes Plus as retaining Max's coding / tool-use / reasoning strengths while adding image and video understanding, but does not disclose it as a fine-tune, post-training, or distillation of Qwen3.7-Max — per qwen.ai/blog?id=qwen3.7-plus and the Alibaba Cloud Bailian / Model Studio Recommended models page.

About this page

Cross-family comparison page in the /ai/ section. Each per-provider tree was hand-laid-out from the per-family Claude, ChatGPT, Gemini, Grok, Llama, DeepSeek, Mistral, and Qwen Versions pages on this site, each row's lineage edge sourced to the provider's own model card, technical paper, or release announcement.

Conservative claims. Where a provider has not formally documented base-model continuity between two releases, the edge is labeled "succession" and no further claim is made. The page does not infer "X is a fine-tune of Y" from external speculation, leaderboard chatter, or press coverage. The "succession" edge is honest about what is and isn't disclosed; the per-edge tooltip and the "Lineage as text" block name the source for every more-specific edge type.

Per-provider focus, not encyclopedic. Each tree highlights the lineage-meaningful releases — the new bases, the major post-trainings, the visible fine-tunes, the line merges. Per-release minutiae (small variants, intermediate checkpoints, every refresh on every tier) live on the per-family Versions pages where they belong. The /ai/release-cadence/ and /ai/context-windows/ pages are the right place for per-release counts and metrics.

What is intentionally excluded. Open-source community fine-tunes (the Llama-derivatives ecosystem — Vicuna, WizardLM, Nous Hermes, etc.) are not on the page; they are downstream community work, not frontier-lab releases. Capability comparisons ("which line is best") are out of scope. Speculation about undisclosed base-model continuity is out of scope. Unreleased models (announced but never publicly shipped, like Llama 4 Behemoth) are not included.

Refreshed daily, aligned with the per-family Versions pages. Each refresh re-verifies every disclosed lineage source (model cards / papers / announcements move under the same URLs but their content changes), adds nodes for new flagship releases, and prunes any node that the provider has formally deprecated. See release cadence and context windows for the cross-family ship-cadence and context-budget pictures this page complements.

Last updated: June 23, 2026. 130 models · 8 providers · 125 typed edges.

Last refreshed 2026-06-23 by Callisto — moved Qwen flagship node, softened Plus edge.