Reference·June 2026

AI API price history: every verified change, on the record

Name: benchr AI API price history
Creator: benchr
License: https://creativecommons.org/licenses/by/4.0/

Providers rewrite their pricing pages and the old numbers vanish. This log keeps them: append-only, dated, and tied to official sources.

By the benchr team · Updated July 24, 2026 · Each price is a dated rate-card snapshot, not a promise that the rate will continue · View changelog

Entries loggedsince June 1, 2026

Cheapest input / 1MDeepSeek V4-Flash

Priciest output / 1MClaude Fable 5

Price spreadoutput, $0.28 to $50 per 1M

How this log works

The log follows three rules. Append-only: entries are not edited or removed after publication. Same-day verification: a number enters the log only on a day benchr confirms it on a provider pricing page or announcement, not from a search snippet or third-party table. Source attached: every entry links to the record used for verification.

The append-only log begins on June 1, 2026 and does not backfill earlier prices. Older documented changes appear below as separate context, with their own sources.

The log so far

Append-only entries from history.json: input/output per 1M tokens at time of logging
Logged	Model	Event	Input	Output
2026-07-13	GPT-5.6 Sol	Context and pricing re-verified	$5.00	$30.00
2026-07-13	Grok 4.5	Model added	$2.00	$6.00
2026-07-13	GLM-5.2	Model added	$1.40	$4.40
2026-07-13	MiniMax M3	Model added	$0.30	$1.20
2026-07-09	GPT-5.6	General availability recorded	$5.00	$30.00
2026-07-08	Gemini 3.5 Pro	Correction: June 30 entry retracted	—	—
2026-07-08	GPT-5.6	Correction: preview status clarified	—	—
2026-07-08	Claude Sonnet 5	Correction: maximum output	—	—
2026-07-03	Claude Sonnet 5	Price correction	$2.00	$10.00
2026-07-01	Claude Sonnet 5	Original launch entry; price corrected July 3	$4.00	$20.00
2026-06-30	Gemini 3.5 Pro	Erroneous entry; retracted July 8	$2.50	$15.00
2026-06-10	Claude Fable 5	New model (Jun 9 release)	$10.00	$50.00
2026-06-10	GPT-5.4	Baseline (Mar 5 release)	$2.50	$15.00
2026-06-01	Claude Opus 4.8	Baseline (May 28 release)	$5.00	$25.00
2026-06-01	GPT-5.5	Baseline (Apr 23 release)	$5.00	$30.00
2026-06-01	GPT-5	Baseline (Aug 2025 release)	$1.25	$10.00
2026-06-01	GPT-5 Mini	Baseline	$0.25	$2.00
2026-06-01	Gemini 3.5 Flash	Baseline (May 19 GA)	$1.50	$9.00
2026-06-01	Grok 4.3	Baseline (Apr release)	$1.25	$2.50
2026-06-01	DeepSeek V4-Pro	Baseline (Apr 24 release)	$0.435	$0.87
2026-06-01	DeepSeek V4-Flash	Baseline	$0.14	$0.28
2026-06-01	Kimi K2.6	Baseline (Apr 20 release)	$0.95	$4.00
2026-06-01	Mistral Large 3	Baseline (Dec 2 release)	$0.50	$1.50
2026-06-01	Llama 4 Maverick	Baseline (open weights)	$0 license	$0 license
2026-06-01	Llama 4 Scout	Baseline (open weights)	$0 license	$0 license

Most entries are baselines: the verified starting point each future change gets measured against. That's how an honest price database begins: you can't detect a change without a trustworthy "before."

Documented moves from before the log

These predate June 1, 2026, so they live outside the append-only dataset, but each is documented by the provider's own announcements and worth keeping in view:

The Opus lane fell two-thirds. Claude Opus 4 and 4.1 billed $15/$75 per million tokens through 2025. Since Opus 4.5 (November 2025), the lane bills $5/$25 — and both old models retire this summer, making the cut universal.

GPT-4o halved within months. The May 2024 launch snapshot billed $5/$15; later 2024 snapshots billed $2.50/$10. The original snapshot, still billing 2024 prices, shuts down October 23, 2026.

Reasoning got cheap. o1 launched December 2024 at $15/$60, and o1-pro hit $150/$600 in March 2025, OpenAI's priciest model ever. Today GPT-5.5 reasons better at $5/$30.

DeepSeek's June snapshot reset the recorded floor. The June 1, 2026 log captured V4-Pro at $0.435/$0.87 after a reported 75% reduction on api-docs.deepseek.com. Those figures describe that dated rate card; they can change, so the current provider page remains the source for a new purchase or budget.

And the increases

Published base rates are only one source of cost changes. Google's October 16 cutoff moves Gemini 2.5 Flash traffic to a model billing 5× more per input token. xAI re-pointed legacy Grok API slugs to Grok 4.3 billing, changing the price of unchanged integrations. Gemini's over-200K-token surcharge also raised effective long-context rates for some workloads. For that reason, this log records retirements, routing changes, and migration effects alongside rate-card updates.

What gets logged

Six event types match the dataset schema: price_change, new_model, benchmark_update, deprecation_announced, sunset, and context_change. Pricing means published list rates for input, output, and, where offered, cached input. Promotional and negotiated enterprise rates stay out. Open-weight records with no license charge are logged as $0-license entries; that is not a $0 infrastructure claim. Verified changes also appear in the site changelog and affected pricing pages.

Frequently asked

Can I use this data in my own work?

Yes: CC BY 4.0. Articles, research, dashboards, commercial tools, all fine. Attribution is a link to benchr.org. The JSON downloads include every entry's official source.

Why no entries before June 1, 2026?

Entries are only added on the day they're verified against the provider's live page. Backfilled numbers from memory or stale snippets are how price datasets rot. Older documented moves appear as sourced context above, outside the log.

Do AI prices ever go up?

Rate cards rarely rise; effective costs do. Forced migrations (Gemini 2.5 → 3.5 Flash), slug re-pointing (Grok), and long-context surcharges all raised real bills in 2026. The log tracks those events, not just list prices.

How often is this updated?

Whenever a verified event lands, typically within a day or two of a provider announcement. The deprecations RSS feed and main feed carry the alerts.

Changelog

July 24, 2026 — Reframed the DeepSeek V4-Pro reduction as a June 1 rate-card snapshot and removed the unsupported claim that the price would continue indefinitely.
June 12, 2026 — Page published. Log contains 14 entries (June 1 baselines plus the June 10 Fable 5 and GPT-5.4 additions). Pre-log context moves sourced to provider announcements.

Sources

benchr history.json — the append-only log (per-entry sources inside)
benchr deprecations.json — lifecycle events with official source URLs
Provider pricing pages: platform.claude.com · openai.com/api/pricing · ai.google.dev/gemini-api/docs/pricing · api-docs.deepseek.com · x.ai/api (all re-verified June 12, 2026)
Anthropic Claude Opus 4.5 announcement (Opus-lane repricing, Nov 2025) — anthropic.com/news/claude-opus-4-5

AI API price history: every verified change, on the record

How this log works

The log so far

Documented moves from before the log

And the increases

What gets logged

Frequently asked

Changelog

Sources

Deprecations hub.

Cost calculator.

Pricing comparison 2026.