Release Timeline · Updated June 1, 2026
AI model release timeline
Every major frontier and open-weight model release, newest first. Dates, pricing, and the key thing that changed. Updated same-day when new models ship.
2026
May 28, 2026
Claude Opus 4.8
Frontier
Anthropic · Review →
$5/$25 per 1M tokens standard; $10/$50 Fast Mode. SWE-Bench Pro 69.2% (up from 64.3% on 4.7). Major alignment and honesty improvements. API:
claude-opus-4-8.
May 19, 2026 (GA)
Gemini 3.5 Flash
Frontier
Google · Review →
$1.50/$9.00 per 1M; free API tier. 1M token context, 64K output. 289 tok/s throughput — one of the fastest frontier models. Announced at Google I/O 2026.
April 2026
Claude Opus 4.7
Frontier
Anthropic · Review →
$5/$25 per 1M. SWE-Bench Pro 64.3%. Now superseded by 4.8 but still fully supported.
April 24, 2026
GPT-5.5
Frontier
OpenAI · Review →
$5/$30 per 1M standard; $2.50/$15 batch. 1.05M context, 128K output. Built for agentic coding and computer use. Successor to GPT-5.
April 24, 2026
DeepSeek V4-Pro & V4-Flash
Open (MIT)
DeepSeek · Review →
V4-Pro: $0.435/$0.87 per 1M; cache hit $0.003625. V4-Flash: $0.14/$0.28 — cheapest commercial API. 1M context, 384K output. MIT license. Self-hosted or API.
April 22, 2026
Qwen 3.6
Open (Apache 2.0)
Alibaba · Review →
Apache 2.0. Dense 27B and MoE 35B-A3B variants. 262K native context (~1M extended). Strong multilingual — 201 languages. API: ~$0.30/$0.90 per 1M cloud.
April 20, 2026
Kimi K2.6
Open (Modified MIT)
Moonshot AI · Review →
$0.95/$4.00 per 1M; cached $0.16. 256K context. 1T total / 32B active MoE. Good multilingual performance.
April 2026
Grok 4.3
Frontier
xAI · Review →
$1.25/$2.50 per 1M; cached $0.20. 1M context. Real-time data via X. Very cheap output tokens for agentic loops.
April 2026
Mistral Medium 3.5
Open (Modified MIT)
Mistral
$1.50/$7.50 per 1M. Public preview. 256K context. Multimodal + reasoning. Self-host on 4 GPUs.
February 2026
Gemini 3.1 Pro
Frontier
Google · Review →
$2/$12 per 1M (above 200K: $4/$18). 1M context, 64K output. Deepest reasoning in the Gemini family. Replaced Gemini 3 Pro.
2025
December 2, 2025
Mistral Large 3
Open (Apache 2.0)
Mistral · Review →
$0.50/$1.50 per 1M. Apache 2.0 — true open commercial license. 675B total / 41B active MoE. 256K context. API id: mistral-large-2512.
November 2025
Claude Haiku 4.5
Small
Anthropic · Review →
$1/$5 per 1M. 200K context. 145 tok/s. High-volume routing and classification tier.
September 2025
Claude Sonnet 4.6
Mid
Anthropic · Review →
$3/$15 per 1M. 200K context. 95 tok/s. The production daily-driver between Haiku and Opus.
August 7, 2025
GPT-5
Frontier
OpenAI · Review →
$1.25/$10 per 1M. 400K context, 128K output. The everyday OpenAI production pick at a rational price point.
April 5, 2025
Llama 4 Scout & Maverick
Open (Community)
Meta · Review →
Free under Meta's Community License. Scout: 10M token context, 17B/109B; Maverick: 1M context, 17B/400B, 128 experts. Self-hosted at no API cost.
January 2025
Phi-4
Small
Microsoft
MIT license. 14B parameter dense model. 16K context. Built for edge/local inference. Punches above its weight on MMLU and math.