AI API price history: every verified change, on the record

Providers rewrite their pricing pages and the old numbers vanish. This log keeps them: append-only, dated, and tied to official sources.

By the benchr team · · Every logged figure carries its official source and verification date · View changelog

Entries loggedsince June 1, 2026
Cheapest input / 1MDeepSeek V4-Flash
Priciest output / 1MClaude Fable 5
Price spreadoutput, $0.28 to $50 per 1M

How this log works

Three rules, enforced since the first entry. Append-only: entries never get edited or removed, even when they age badly. That's what makes it a record. Same-day verification: a number enters the log only on a day benchr confirmed it on the provider's own pricing page or announcement, never from a search snippet or a third-party table. Source attached: every entry names where the number came from, so you can re-check it yourself.

That discipline has a cost: the log opens June 1, 2026, and refuses to backfill. A price archive you can trust beats a longer one you can't. Documented older moves appear below as context, with their own sources, clearly fenced off from the log.

The log so far

Append-only entries from history.json: input/output per 1M tokens at time of logging
LoggedModelEventInputOutput
2026-06-10Claude Fable 5New model (Jun 9 release)$10.00$50.00
2026-06-10GPT-5.4Baseline (Mar 5 release)$2.50$15.00
2026-06-01Claude Opus 4.8Baseline (May 28 release)$5.00$25.00
2026-06-01GPT-5.5Baseline (Apr 23 release)$5.00$30.00
2026-06-01GPT-5Baseline (Aug 2025 release)$1.25$10.00
2026-06-01GPT-5 MiniBaseline$0.25$2.00
2026-06-01Gemini 3.5 FlashBaseline (May 19 GA)$1.50$9.00
2026-06-01Grok 4.3Baseline (Apr release)$1.25$2.50
2026-06-01DeepSeek V4-ProBaseline (Apr 24 release)$0.435$0.87
2026-06-01DeepSeek V4-FlashBaseline$0.14$0.28
2026-06-01Kimi K2.6Baseline (Apr 20 release)$0.95$4.00
2026-06-01Mistral Large 3Baseline (Dec 2 release)$0.50$1.50
2026-06-01Llama 4 Maverick / ScoutBaseline (open weights)$0 license$0 license

Most entries are baselines: the verified starting point each future change gets measured against. That's how an honest price database begins: you can't detect a change without a trustworthy "before."

Documented moves from before the log

These predate June 1, 2026, so they live outside the append-only dataset, but each is documented by the provider's own announcements and worth keeping in view:

The Opus lane fell two-thirds. Claude Opus 4 and 4.1 billed $15/$75 per million tokens through 2025. Since Opus 4.5 (November 2025), the lane bills $5/$25 — and both old models retire this summer, making the cut universal.

GPT-4o halved within months. The May 2024 launch snapshot billed $5/$15; later 2024 snapshots billed $2.50/$10. The original snapshot, still billing 2024 prices, shuts down October 23, 2026.

Reasoning got cheap. o1 launched December 2024 at $15/$60, and o1-pro hit $150/$600 in March 2025, OpenAI's priciest model ever. Today GPT-5.5 reasons better at $5/$30.

DeepSeek reset the floor twice. V4's April 2026 launch pricing undercut every closed model, and in June 2026 DeepSeek made its 75% V4-Pro reduction permanent at $0.435/$0.87, confirmed on api-docs.deepseek.com after third-party tables wrongly reported a rise.

And the increases

Headline rates rarely rise. Costs do — through routing and forced migration. Google's October 16 cutoff moves Gemini 2.5 Flash traffic onto a model billing 5× more per input token. xAI re-pointed legacy Grok API slugs to Grok 4.3 billing, repricing calls nobody touched. Gemini's over-200K-token surcharge doubled effective long-context rates for some workloads. A price database that only read rate cards would miss all three, which is why this log records sunsets and migrations alongside price changes.

What gets logged

Six event types, matching the dataset's schema: price_change, new_model, benchmark_update, deprecation_announced, sunset, and context_change. Pricing means published list rates for input, output, and (where offered) cached input; promotional and negotiated enterprise rates stay out. Free open-weight models are logged as $0-license entries so self-hosting comparisons stay honest about what "free" means. When a change lands, it also hits the site changelog and the affected pricing pages the same day.

Frequently asked

Can I use this data in my own work?

Yes: CC BY 4.0. Articles, research, dashboards, commercial tools, all fine. Attribution is a link to benchr.org. The JSON downloads include every entry's official source.

Why no entries before June 1, 2026?

Entries are only added on the day they're verified against the provider's live page. Backfilled numbers from memory or stale snippets are how price datasets rot. Older documented moves appear as sourced context above, outside the log.

Do AI prices ever go up?

Rate cards rarely rise; effective costs do. Forced migrations (Gemini 2.5 → 3.5 Flash), slug re-pointing (Grok), and long-context surcharges all raised real bills in 2026. The log tracks those events, not just list prices.

How often is this updated?

Whenever a verified event lands, typically within a day or two of a provider announcement. The deprecations RSS feed and main feed carry the alerts.

Changelog

  • — Page published. Log contains 14 entries (June 1 baselines plus the June 10 Fable 5 and GPT-5.4 additions). Pre-log context moves sourced to provider announcements.

Sources

  • benchr history.json — the append-only log (per-entry sources inside)
  • benchr deprecations.json — lifecycle events with official source URLs
  • Provider pricing pages: platform.claude.com · openai.com/api/pricing · ai.google.dev/gemini-api/docs/pricing · api-docs.deepseek.com · x.ai/api (all re-verified June 12, 2026)
  • Anthropic Claude Opus 4.5 announcement (Opus-lane repricing, Nov 2025) — anthropic.com/news/claude-opus-4-5