Pricing Guide ยท Meta

Llama 4 Scout API Pricing

Detailed token costs, context window limits, benchmark performance, and caching structures for Llama 4 Scout.

Last reviewed: June 6, 2026 Sourced from official docs

Llama 4 Scout Cost Breakdown

Dimension Value / Cost
Input Tokens Cost / 1MSelf-hosted
Output Tokens Cost / 1MSelf-hosted
Cached Input Tokens Cost / 1Mโ€”
Max Context Window10,000,000 tokens
SWE-bench Verified Score56.0%
GPQA Diamond Score57.2%

Best suited for:

  • Ultra-long-context tasks (10M token window)
  • \n
  • Fast self-hosted inference
  • \n
  • Free multimodal at scale

← Back to all pricing guides