Deprecation·June 2026

Gemini 2.5 Pro and Flash retire October 16, 2026

Google's recorded replacements for Gemini 2.5 Pro and Flash have different prices. Calculate the effect on your workload before the October date.

By the benchr team · Updated July 24, 2026 · Verified against Google's Gemini API deprecation, model, and pricing docs, July 24, 2026 · View changelog

Models and dates in scope

The 2.5 generation carried Google's API story through most of 2025; 2.5 Pro was the first Gemini many teams took seriously for reasoning, and 2.5 Flash became the default cheap multimodal pick. Both got their end date when Google's deprecation table put October 16, 2026 next to each ID. The 2.0 Flash family already went dark June 1, and Gemini 3 Pro Preview lasted just 16 weeks before its March 9 shutdown. By November, nothing before the Gemini 3 line will answer an API call.

One Google-specific wrinkle: the company publishes these as earliest-possible dates and promises advance confirmation of the real one. Don't read that as a reprieve. Plan for October 16; treat anything later as a bonus.

What migration does to the bill

Official migration paths: list price per 1M tokens, verified June 12, 2026
Path	Input, today	Input, after	Output, today	Output, after
2.5 Pro → 3.1 Pro	$1.25	$2.00	$10.00	$12.00
2.5 Flash → 3.6 Flash	$0.30	$1.50	$2.50	$7.50

Spelled out: the Pro path costs 60% more on input and 20% more on output, and the long-context surcharge above 200K tokens rises in step ($2.50/$15 becomes $4/$18). The Flash path is the painful one. A pipeline pushing 500M input and 50M output tokens a month pays $275 today on 2.5 Flash — and $1,125 on 3.6 Flash. Same traffic, about 4.1× the invoice.

To be fair to the current Flash replacement: Gemini 3.6 Flash is a stable GA model, and its $7.50 output rate is lower than 3.5 Flash's $9. Google now lists it as the recommended replacement for Gemini 2.5 Flash. That makes it the documented migration target, not an automatic winner for every workload; test quality and token use on your own traffic. The point is that "recommended replacement" and "equivalent cost" stopped being the same thing, and Google's docs won't do that math for you.

Migration checks before cutover

Model IDs: swap to gemini-3.1-pro-preview or gemini-3.6-flash. Note the 3.1 Pro replacement still carries the Preview label — Google retired a preview into a preview once already this year, so pin your tests to current behavior rather than assuming long stability.

Thinking budgets: the Gemini 3 line reasons by default and bills those tokens as output. Output-heavy traffic that was priced on 2.5-era token counts will drift. Watch the first invoices, not just the rate card.

Vertex AI users: Google Cloud's deprecation calendar is managed separately from the Gemini API's. The dates usually rhyme; they aren't guaranteed identical. Check the Vertex model table for your project's region.

Re-quote before you re-platform. Ten minutes in the cost calculator with your real token volumes settles whether the official path, a cheaper provider, or a split (3.6 Flash for hard calls, a budget model for bulk) wins. The rankings show where every current model sits on price and capability.

Frequently asked

When exactly do 2.5 Pro and 2.5 Flash stop working?

Google's deprecation table lists October 16, 2026 for both, framed as the earliest possible date, with the exact one confirmed in advance. Plan for October 16.

Will the migration raise my bill?

On list price, yes. The Pro path adds 60% on input; the Flash path to Gemini 3.6 Flash multiplies input by 5 and output by 3. Whether better quality and lower token use offset that depends on your workload. Run your own volumes.

What already shut down this year?

Gemini 3 Pro Preview on March 9, 2026 (Vertex listed it discontinued by March 26) and the Gemini 2.0 Flash family on June 1, 2026.

Changelog

July 24, 2026 — Updated Google's current Gemini 2.5 Flash replacement to Gemini 3.6 Flash and recalculated the migration examples at its $1.50/$7.50 standard price.
June 12, 2026 — Published. Shutdown dates from ai.google.dev/gemini-api/docs/deprecations; pricing from ai.google.dev/gemini-api/docs/pricing; both verified today.

Sources

Gemini API deprecations — ai.google.dev/gemini-api/docs/deprecations (verified July 24, 2026)
Gemini API pricing — ai.google.dev/gemini-api/docs/pricing (verified July 24, 2026)
Gemini 3.6 Flash model page — ai.google.dev/gemini-api/docs/models/gemini-3.6-flash (verified July 24, 2026)
benchr deprecations dataset · model-figures.json

Gemini 2.5 Pro and Flash retire October 16, 2026

Models and dates in scope

What migration does to the bill

Migration checks before cutover

Frequently asked

Changelog

Sources

Gemini 3.6 Flash, launched.

Cheapest LLM APIs, ranked.

All deprecations.