GPT-5.4 API pricing: the middle child with a case

Double GPT-5's rate, half of GPT-5.5's. Three months after launch, the in-between tier is quietly the value pick for context-heavy professional work.

By the benchr team · · Figures verified against official sources, June 10, 2026 · View changelog

Input / 1MOpenAI · March 2026
Output / 1MOpenAI
Cached input / 1M90% discount
OSWorld-Verifiedcomputer use, official

Pricing breakdown

gpt-5.4 — official OpenAI pricing, June 10, 2026
TierRate / 1M tokens
Standard input$2.50
Standard output$15.00
Cached input$0.25
Long-context (input >272K)surcharge applies
Context windowup to 1,000,000 tokens
Max output128,000 tokens

The model id is gpt-5.4. The family launched March 5, 2026 as GPT-5.4 Thinking and GPT-5.4 Pro, with mini and nano following on March 17. Mini is the free-tier ChatGPT model; nano is API-only. GPT-5.5 took the flagship crown seven weeks later, which is exactly why this page exists: superseded models get price-stable, and price-stable mid-tiers are where the value hides.

Where it sits: the OpenAI ladder

OpenAI's current text-model tiers, official rates
ModelInput / 1MOutput / 1MContext
GPT-5 Mini$0.25$2.00400K
GPT-5$1.25$10.00400K
GPT-5.4$2.50$15.00up to 1M
GPT-5.5$5.00$30.001.05M

Read the ladder by what each doubling buys. GPT-5 to GPT-5.4: the context ceiling jumps from 400K to 1M, computer use arrives, and the finance work gets a tuned model. That's a real capability step. GPT-5.4 to GPT-5.5: the benchmark ceiling rises (84.0% SWE-bench Verified, officially published) and you pay double again for it. Teams that need the context but not the absolute ceiling are the ones the middle tier is for. The OpenAI pricing guide walks the whole lineup.

The 272K surcharge, again

Like GPT-5.5, GPT-5.4's advertised million-token context comes with a pricing cliff: inputs beyond the 272K-token standard window bill at higher rates per OpenAI's pricing page. The failure mode is a long-running session that quietly crosses the line and reprices the whole input. The defenses are the same ones the 272K-cliff explainer covers for 5.5: track session token counts, summarize-and-restart before the line, or chunk via retrieval. If your sessions never approach 272K, the cliff is irrelevant and the headline rate is what you pay.

Caching at $0.25: the quiet bargain

GPT-5.4's cached input at $0.25/1M makes it one of the cheapest frontier-tier models for prompt-heavy agents: cheaper per cached token than GPT-5's standard input, and a fifth of Sonnet 4.6's cached rate. A workflow with an 80K-token stable prefix called 5,000 times a day bills roughly $100/day at the cached rate against $1,000 uncached. For document-heavy professional work, the model's home turf, that discount compounds fast. Run your mix in the cost calculator.

The Excel angle

GPT-5.4 launched alongside ChatGPT for Excel, tuned on real finance workflows — modeling, scenario analysis, long-form research. OpenAI reported its internal investment-banking benchmark jumping from 43.7% with GPT-5 to 87.3% with GPT-5.4 Thinking, and OSWorld-Verified computer use at 75% against a 72.4% human baseline. Those numbers are OpenAI's own, but they explain the model's positioning: it was built to be the professional-work tier, not the everything tier. Spreadsheet-heavy teams should read the spreadsheets roundup before picking.

Use-case fit

Best for: Financial modeling and document analysis; computer-use pipelines that don't need GPT-5.5's ceiling; context-heavy work in the 400K–1M range where GPT-5 can't follow; prompt-heavy agents that exploit GPT-5.4's $0.25 cached rate.

Skip if: You want OpenAI's strongest — that's GPT-5.5 at double the price; your tasks fit in 400K context and short prompts — GPT-5 at half price covers it; you need cheap volume — GPT-5 Mini costs a tenth.

Frequently asked

How much does GPT-5.4 cost per million tokens?

GPT-5.4 costs $2.50/1M input and $15/1M output, with cached input at $0.25/1M. Inputs beyond the 272K standard window carry a long-context surcharge per OpenAI's pricing page. Released March 5, 2026; still fully available after GPT-5.5 took the flagship slot.

Is GPT-5.4 worth it over GPT-5 or GPT-5.5?

It's the cost-per-quality middle. Double GPT-5's price buys context up to 1M (vs 400K), built-in computer use at 75% OSWorld-Verified, and finance tuning — at exactly half of GPT-5.5's $5/$30. Pick it when you need the context or computer use but not the flagship ceiling.

What is the GPT-5.4 long-context surcharge?

Higher rates apply once input exceeds the 272K-token standard window — the same cliff structure as GPT-5.5. If sessions regularly cross 272K, model costs at the surcharge rate or chunk inputs to stay under the line.

Changelog

  • — Published. Pricing and context verified on OpenAI's pricing page and release notes; OSWorld and finance-benchmark figures attributed to OpenAI's launch material. Note: benchr's index briefly removed GPT-5.4 on June 1 as "unverified" — that was an error, corrected June 10 with full re-verification.

Sources

  • OpenAI API pricing — openai.com/api/pricing (verified June 10, 2026)
  • OpenAI, "Introducing GPT-5.4 mini and nano" — openai.com/index/introducing-gpt-5-4-mini-and-nano (verified June 10, 2026)
  • OpenAI model release notes — help.openai.com (verified June 10, 2026)
  • benchr model-figures.json — verified June 10, 2026