AI · Live reference
The frontier AI models, right now
The frontier AI models, as of June 12, 2026: Anthropic's Claude Fable 5 (released June 9, 2026), OpenAI's GPT-5.5 (released April 23, 2026), Google's Gemini 3.5 Flash (Stable GA May 19, 2026), xAI's Grok 4.3 (released April 17, 2026), Meta's Muse Spark (released April 8, 2026), DeepSeek's DeepSeek-V4-Pro (released April 24, 2026), Mistral's Mistral Medium 3.5 (released April 28, 2026), and Alibaba's Qwen3.7-Plus (released May 31, 2026). The detail below covers each family in full — every release, every API string, every context window.
Last verified: June 12, 2026 · refreshed daily. 8 families · 3 US public · 2 US private · 2 Chinese · 1 EU.
Filter by lab status
Frontier AI model families table
Family
ChatGPT
GPT · o-series
OpenAI US private
GPT-5.5
Apr 23, 2026
What is the newest ChatGPT or GPT model?
OpenAI's flagship line, from GPT-1 (June 2018) through GPT-5.5 (April 2026). The August 2025 GPT-5 release introduced a unified router that picks between fast-response and reasoning models within a single endpoint; the 5.x cadence has been roughly six weeks per release through 5.5. The o-series reasoning models (o1, o3, o4) have been merged into the main line under GPT-5.
- Lab: OpenAI — San Francisco, California, USA. Privately held; IPO targeted Q4 2026.
- Current model: GPT-5.5, shipped April 23, 2026. API:
gpt-5.5/gpt-5.5-pro. - First model: GPT-1, June 11, 2018. Total versions: 25.
- Pages on this site:
- ChatGPT Versions — the full lineage
- OpenAI Lawsuits — NYT v. OpenAI, Authors Guild, Musk v. Altman, etc.
- OpenAI Leadership — founders, board, governance
- OpenAI on the Orgs roster
- Primary sources: OpenAI Models · OpenAI Announcements
Family
Claude
Mythos · Opus · Sonnet · Haiku
Anthropic US private
Claude Fable 5
June 9, 2026 · 3 days ago
What is the latest Claude model?
Anthropic's flagship line, from Claude 1 (March 2023) through Claude Fable 5 (June 9, 2026) — the first generally-available model in the new Mythos class, a tier Anthropic positions above Opus. The three-tier structure (Opus for the largest, Sonnet for the balanced mid-tier, Haiku for the small-and-fast tier) was introduced with Claude 3 in March 2024; the Mythos class sits above it. Constitutional AI is the safety-training framing that shipped with Claude 1.
- Lab: Anthropic — San Francisco, California, USA. Privately held; began IPO prep in late 2025.
- Current model: Claude Fable 5, shipped June 9, 2026 — the first generally-available Mythos-class model, a tier above Opus. API:
claude-fable-5. Opus-tier flagship remains Claude Opus 4.8 (claude-opus-4-8, May 28, 2026). - First model: Claude 1, March 14, 2023. Total versions: 22.
- Pages on this site:
- Claude Versions — the full lineage
- Anthropic Lawsuits — Bartz v. Anthropic, Concord Music Group, Reddit v. Anthropic
- Anthropic Leadership — cofounders, the Long-Term Benefit Trust, governance
- Anthropic on the Orgs roster
- Primary sources: Anthropic Models · Anthropic News
Family
DeepSeek
V-series · R-series
DeepSeek Chinese
DeepSeek-V4-Pro
Apr 24, 2026
What is the latest DeepSeek model?
Hangzhou-based open-weights frontier-LLM lab. Released DeepSeek-V3 (December 2024) and DeepSeek-R1 (January 2026) at training-cost levels far below US-frontier peers, sparking a global re-evaluation of frontier-model economics. V4-Pro is the current flagship: a 1.6-trillion-parameter Mixture-of-Experts model with 49B active parameters per token and a 1M-token context window, MIT-licensed.
- Lab: DeepSeek (Hangzhou DeepSeek Artificial Intelligence Basic Technology Research) — Hangzhou, China. Privately held subsidiary of High-Flyer Capital Management.
- Current model: DeepSeek-V4-Pro, shipped April 24, 2026. HuggingFace:
deepseek-ai/DeepSeek-V4-Pro. - First model: DeepSeek-LLM, November 27, 2023. Total versions: 20.
- Pages on this site:
- DeepSeek Versions — the full lineage
- DeepSeek on the Orgs roster
- Primary sources: DeepSeek API Docs · deepseek-ai on HuggingFace
Family
Gemini
Pro · Flash · Flash-Lite · Nano
Alphabet (Google) US public
Gemini 3.5 Flash
May 19, 2026
What's Google's most capable AI model right now?
Google's flagship line, from Bard (March 2023) and Gemini 1.0 (December 2023) through Gemini 3.5 Flash (May 2026, Stable GA). Largest integration surface of any frontier line — Search AI Overviews, Workspace (Gmail, Docs, Slides, Meet, Drive), Pixel + Android (on-device Nano), Chrome Built-in AI. The 2.5 line (March 2025) was the first reasoning-as-default frontier model; the 3.x line moved Google to a one-launch-everywhere release pattern.
- Lab: Google DeepMind, inside Alphabet (NASDAQ: GOOGL) — Mountain View, California, USA.
- Current model: Gemini 3.5 Flash, Stable GA May 19, 2026. API:
gemini-3.5-flash. The first Flash tier to outperform the prior generation's Pro flagship on hard coding and agentic benchmarks. The Pro slot is still held by Gemini 3.1 Pro Preview (gemini-3.1-pro-preview, February 19, 2026). - First model: Bard (LaMDA-based), March 21, 2023. Total versions: 26.
- Pages on this site:
- Gemini Versions — the full lineage
- Alphabet on the Orgs roster
- Primary sources: Gemini API Docs · Google DeepMind Blog
Family
Grok
Chat · Heavy · Multi-agent
xAI US public
Grok 4.3
Apr 17, 2026
What is the latest Grok model?
xAI's flagship line, from Grok 1 (November 2023) through Grok 4.3 (April 17, 2026; GA on docs as of May 15, 2026). Native video input and in-chat generation of PDFs / slides / spreadsheets at 4.3; multi-agent collaboration introduced with Grok 4.20 (March 10, 2026) and 16-agent Heavy mode at 1M-token context. The agentic-coding specialist Grok Build 0.1 (grok-build-0.1, May 14, 2026) ships as a Specialized SKU alongside, joined by Composer 2.5 (June 1, 2026 — a Kimi K2.5-based fast agentic-coding model selectable inside Grok Build). xAI is now part of SpaceX following the February 2026 SpaceX-xAI merger; X integration is the original distribution surface.
- Lab: xAI, inside SpaceX (Nasdaq: SPCX) — San Francisco Bay Area, California, USA.
- Current model: Grok 4.3, shipped April 17, 2026 (GA on docs as of May 15, 2026). API:
grok-4.3. Prior flagship Grok 4.20 still available as three 0309-dated SKUs (grok-4.20-multi-agent-0309,grok-4.20-0309-reasoning,grok-4.20-0309-non-reasoning). - First model: Grok 1, November 4, 2023. Total versions: 16 (including point-releases through 4.3, the May 2026 Grok Build 0.1, and the June 2026 Composer 2.5 specialized coding model).
- Pages on this site:
- Grok Versions — the full lineage
- SpaceX on the Orgs roster · xAI Financials
- Primary sources: xAI Models · xAI News
Family
Llama
Open-weights → closed
Meta US public
Muse Spark
Apr 8, 2026
What's Meta's current frontier AI model?
Meta's frontier line, from LLaMA 1 (February 2023, originally research-only) through Llama 4 (April 2025) and into the post-Llama Muse Spark line (April 2026, the first model from the new Meta Superintelligence Labs). Llama 1–4 were the canonical open-weights frontier line; Muse Spark is the company's pivot to closed-weights, API-only frontier models — the end of Meta's open-weights frontier-AI era. The family is in transition; this row tracks the active frontier line under the Meta umbrella.
- Lab: Meta Superintelligence Labs (MSL), inside Meta Platforms (NASDAQ: META) — Menlo Park, California, USA. Scale AI majority-acquired into MSL in 2025.
- Current model: Muse Spark, shipped April 8, 2026. Closed-weights, API-only — no HuggingFace release.
- First model: LLaMA 1, February 24, 2023 (research-only release; weights leaked to 4chan March 3, 2023). Total versions: 15.
- Pages on this site:
- Llama Versions — the full lineage including Muse Spark
- Meta Financials — the parent company's financials
- Meta on the Orgs roster
- Primary sources: llama.com · Meta AI Blog
Family
Mistral
Mistral · Mixtral · Magistral
Mistral AI EU
Mistral Medium 3.5
Apr 28, 2026
What is Mistral's latest model?
Paris-based open-weights AI lab, from Mistral 7B (September 2023) through Mistral Medium 3.5 (April 2026, the largest dense open-weights flagship at 128B dense / 256K context / multimodal). The Mistral / Mixtral / Magistral product structure splits between dense (Mistral), Mixture-of-Experts (Mixtral), and reasoning (Magistral). Largest version count on the roster reflects the lab's high release cadence and its tier proliferation.
- Lab: Mistral AI — Paris, France. Privately held. EU-based; the only non-US, non-Chinese lab on the roster.
- Current model: Mistral Medium 3.5, shipped April 28, 2026. HuggingFace:
mistralai/Mistral-Medium-3.5-128b/ API:mistral-medium-3-5-26-04. 128B dense, 256K context, multimodal. “First flagship merged model” (chat / reasoning / coding / vision in one). Modified MIT license. - First model: Mistral 7B, September 27, 2023. Total versions: 30.
- Pages on this site:
- Mistral Versions — the full lineage
- Mistral on the Orgs roster
- Primary sources: Mistral Docs · mistralai on HuggingFace
Family
Qwen
Qwen · Tongyi Qianwen
Alibaba Chinese
Qwen3.7-Plus
May 31, 2026
What is the latest Qwen model?
Alibaba Cloud's Tongyi Lab line, from Qwen 1 (August 2023) through Qwen3.7-Plus (May 31, 2026 — the multimodal agent flagship of the Qwen3.7 generation), joined by the text-only Qwen3.7-Max (May 20, 2026) as its sibling. The open-weights branch continues through Qwen3.6 in Apache 2.0; both 3.7-Plus and 3.7-Max are closed-weights and proprietary, accessed via Alibaba Cloud Bailian / Model Studio (DashScope) and partners. 1M-token context (256K of which reserved for chain-of-thought), native multimodal vision plus five agentic capabilities — deep reasoning, self-programming, tool invocation, verification, and autonomous iteration. Alibaba reports 3.7-Max beats Claude Opus 4.6 on Terminal-Bench 2.0 / SWE-Bench Pro / MCP-Atlas.
- Lab: Tongyi Lab inside Alibaba Cloud, Alibaba Group (NYSE: BABA) — Hangzhou, China.
- Current model: Qwen3.7-Plus, shipped May 31, 2026. Alibaba Cloud Bailian / Model Studio (DashScope) — closed-weights, proprietary. Multimodal (text, image, video input). 1M-token context with 256K reserved for chain-of-thought. $0.40 / $1.60 per million tokens. Sibling text-only flagship Qwen3.7-Max (May 20, 2026,
qwen3.7-max) remains available. - First model: Qwen 1, August 3, 2023. Total versions: 26.
- Pages on this site:
- Qwen Versions — the full lineage
- Alibaba on the Orgs roster
- Primary sources: Qwen Docs · Qwen on HuggingFace
No families match your filter or search.
Best frontier model for each task
Each pick is the lab whose current flagship leads its primary-source benchmark on the task. Capabilities shift release-to-release; this section is reviewed on every refresh. Reasoning links go to the lab's own announcement or model docs, not to third-party rankings.
Long-context summarization
1 million-token input window with full multimodal understanding across text, audio, image, video, and entire code repositories — per Google's model docs. DeepSeek-V4-Pro matches the 1M window in open weights for the self-hosting path; Anthropic's Claude Opus 4.8 carries a 1 million-token beta on Vertex and Bedrock if a closed-weights alternative is preferred.
Code generation
Anthropic's first generally-available Mythos-class model (June 9, 2026), positioned above Opus, with state-of-the-art results on tested capability benchmarks and particular strength in software engineering per the launch announcement. Opus 4.8 (May 28, 2026) remains Anthropic's recommended Opus-tier model with Dynamic Workflows and effort-control. Alibaba published a competing claim that Qwen3.7-Max beats Claude Opus 4.6 on Terminal-Bench 2.0, SWE-Bench Pro, and MCP-Atlas — framed against the older Opus 4.6.
Math and reasoning
OpenAI's August 2025 GPT-5 unified router merged the o-series reasoning track into the main line; the 5.x cadence has carried it forward through 5.5. Reasoning lineage and AIME / GPQA scores documented on the OpenAI models page. Gemini 3 also posted an ARC-AGI-2 jump (31.1% → 77.1%) per Google's release notes.
Vision and multimodal
Largest deployed multimodal surface of any frontier line: Search AI Overviews, Workspace, Pixel + Android on-device Nano, Chrome Built-in AI. Modality coverage and benchmark numbers on the Gemini model docs. Alibaba's Qwen3.7-Plus (closed-weights, May 31 2026) is the new contender — multimodal vision plus five agentic capabilities at $0.40/$1.60 per million tokens. GPT-5.5 native vision is the OpenAI alternative if you're already on that stack.
Cost per million output tokens
MIT-licensed open weights, the lowest hosted-API price tier among the eight families per the DeepSeek pricing page. Self-hosting drops the cost further when batch volume justifies the GPU spend. Closed-weights cost-tier alternative: Gemini 3.1 Flash-Lite.
Agentic tasks (tool use, computer use)
Most mature agentic ecosystem: tool use, Claude Code, the Model Context Protocol (MCP), computer-use, and Dynamic Workflows that script hundreds of parallel subagents per run — all documented on the Anthropic tool-use docs. Fable 5 ships with longer autonomous-run capability than any prior Claude. xAI's multi-agent Heavy mode (Grok 4.20, 16 agents at 2M context) is a different agentic shape.
Picks are reviewed on every refresh and rotate as flagships ship. A recommendation that names a deprecated or maintenance-tier model is itself a bug — surface to mungomash@gmail.com if you spot one.
What's coming next
A cross-family roadmap of announced-but-not-yet-shipped frontier models — OpenAI's GPT-6, Anthropic's Claude 5 generation, Google's Gemini 4, xAI's Grok 5, Meta's next post-Muse model — is in development at /ai/announcements-roadmap/. Until that page ships, the per-family Versions page for each lab carries the latest pre-announcements where the lab has made them public. Check the release cadence page for the rhythm of shipping across families.
Notable absences
Each is a name visitors might expect to see and a one-line reason for why it isn't here. None of these failed for arbitrary reasons — each fails one of the four inclusion criteria documented in the methodology below.
- BERT, T5, GPT-1, GPT-2, original LaMDA — pre-2022 lines. Predecessors but not currently-active frontier lines. Surfaced on per-family Versions pages where applicable.
- Cohere Command — a frontier line, but Cohere is Toronto-based; Canadian-headquartered labs don't currently have a chip in the lab-status filter. Add a "Canadian" chip and a Cohere row when the next refresh decides to expand the geographic taxonomy.
- IBM Granite — an enterprise-focused open-weights line; visible in the IBM stack but not openly described as a frontier-LLM line by IBM itself. Doesn't meet criterion (3).
- 01.AI Yi — research-active and open-weights, but releases haven't been frontier-class on a consistent generation-by-generation basis. Doesn't meet criterion (3).
- Cerebras — AI accelerator chips, not a frontier-LLM line. Different category.
- Adept, Inflection, Character.AI — each had frontier ambitions, each was acquired into another lab (Amazon, Microsoft, Google respectively) or pivoted away from frontier-LLM development. The family identity didn't survive the acquisition.
- Meta Galactica — retracted by Meta within days of launch in November 2022. Not currently active.
- Stability AI's language models — deprecated. Stability AI's image-gen line (SDXL) is active but is image-gen, not LLM.
- DALL-E, Imagen, SDXL, Sora, Veo, ElevenLabs — non-LLM frontier lines (image-gen, video-gen, speech). Deliberately out of scope for v1; this page is language-only. A future
/ai/multimodal-models/page may cover them. - Llama-derived community fine-tunes (Vicuna, WizardLM, Nous Hermes, etc.) — downstream of the parent family, not separate frontier lines. Llama itself is in the roster.
Recent AI Briefs
3 Model Briefs
See all Briefs →US government orders Anthropic to suspend Claude Fable 5
On June 12, 2026, the US government issued an export-control directive ordering Anthropic to suspend all access to Claude Fable 5 and the restricted Mythos 5 — for every customer, and for foreign nationals inside or outside the United States, including Anthropic’s own foreign-national employees. Anthropic says it received the letter at 5:21pm ET, disabled both models to comply, and that access to all other Claude models is unaffected, with queries defaulting to Claude Opus 4.8. The government cited national-security authorities and, per Anthropic,…
Anthropic releases Claude Fable 5, its first generally-available Mythos-class model
Anthropic released Claude Fable 5 on June 9, 2026 — the first generally-available model in a new “Mythos” class that the company positions above its Opus tier in capability. Anthropic reports state-of-the-art results on nearly all tested capability benchmarks, with particular strength in software engineering, knowledge work, vision, and scientific research, and says Fable 5 and the restricted Mythos 5 can work autonomously for longer than any prior Claude. The shape of the release is the part to watch. Fable 5 is the public,…
Anthropic releases Claude Opus 4.8 with sharper judgement and dynamic workflows
Anthropic shipped Claude Opus 4.8 on May 28, 2026 — an upgrade to Opus 4.7 across coding, agent work, reasoning, and knowledge work. The release notes frame the model as having sharper judgement, more honesty about its progress, and the ability to work independently for longer; Anthropic reports a more-than-10x reduction in overconfidence versus 4.7, and says it is the first Claude model to score 0% on uncritically reporting flawed results — a calibration posture that maps to the trust-engineering direction the frontier labs…
About this list
Inclusion criteria. Curated to exactly the model families that satisfy all four of: (1) has a Mungomash /ai/<family>/versions/ page on this site, (2) has shipped a generation-class flagship in the last 18 months, (3) is openly described as a frontier-LLM line by its lab, and (4) has a primary API surface visitors can call (first-party hosted API, OpenAI-compatible API, or open-weights with a HuggingFace release). Closed-internal models with no callable surface, pre-2022 lines, research previews, and non-LLM frontier lines (image, video, speech) are out by criterion. See "Notable absences" above for the full list of considered-but-excluded names.
Data sourcing. Each row's data — current model, current ship date, first model, total versions — is sourced from this site's per-family Versions pages, which are themselves cross-checked against each lab's primary documentation on every refresh. The "current model" is the most-recent flagship release on the family's Versions page; "total versions" is the row count on that page.
Relationship to other AI-section pages. This page is the entity-roster — one row per family. Several adjacent cross-family pages will live alongside it as horizontal slices that visualize a single specific axis (context windows, model lineage, pricing history, release cadence, training-data disclosures); each can link back to a family's row here for the entity-level context. The per-family /ai/<family>/versions/ pages remain canonical for the full lineage data.
Data freshness. The roster, every current-model name, every current-model ship date, every total-versions count, and every lab status are re-verified against the per-family Versions pages and each lab's primary documentation on every refresh of this page. If a lab has shipped a new flagship since the last refresh, the row is updated; if a privately-held lab has gone public or been acquired, the labstatus is updated. Stale roster data on a frontier-LLM page drifts within weeks — the release cadence is fast.
What's intentionally excluded. Cross-family benchmark scores (noisy, change weekly, lab-published numbers are aggressive marketing). Live API pricing (scope of /ai/pricing-history/). Context-window comparison (scope of /ai/context-windows/). Visual family-tree relationships between models (scope of /ai/model-lineage/). Editorial framing about which family is “best.”
Last updated: 2026-06-12. Looking for the per-family lineage? Each row above links to the family's /ai/<family>/versions/ page plus the lab's profile.