Pricing Guide ยท Microsoft
Phi-4 API Pricing
Detailed token costs, context window limits, benchmark performance, and caching structures for Phi-4.
Phi-4 Cost Breakdown
| Dimension | Value / Cost |
|---|---|
| Input Tokens Cost / 1M | Self-hosted |
| Output Tokens Cost / 1M | Self-hosted |
| Cached Input Tokens Cost / 1M | โ |
| Max Context Window | 16,000 tokens |
| SWE-bench Verified Score | 30.0% |
| GPQA Diamond Score | 56.1% |
Best suited for:
- Local inference on consumer hardware \n
- Edge deployment \n
- Reasoning at tiny scale