LLM Token Pricing
(Pricing data as of December 2025)
Token pricing data from major LLM providers, grouped into Frontier, Mid, and OSS tiers. This page presents both median pricing by tier and detailed model-level input, output, and context window data.
Median Token Prices by Tier
Detailed Model Pricing
| Model | |||||
|---|---|---|---|---|---|
| Frontier High | Anthropic | Claude Opus 4.5 | $5.00 | $25.00 | 200,000 |
| Frontier High | Together.AI | Cogito v2 preview – 405B | $3.50 | $3.50 | 32,000 |
| Frontier High | Together.AI | Cogito v2 preview – 671B MoE | $1.25 | $1.25 | 128,000 |
| Frontier High | Together.AI | DeepSeek-R1 | $3.00 | $7.00 | 128,000 |
| Frontier High | Fireworks.AI | DeepSeek-R1 | $1.35 | $5.40 | 128,000 |
| Frontier High | Gemini 3 Pro | $2.00 | $12.00 | 1,000,000 | |
| Frontier High | OpenAI | GPT-5 pro | $15.00 | $120.00 | 400,000 |
| Frontier High | OpenAI | GPT-5.1 | $1.25 | $10.00 | 400,000 |
| Frontier High | Fireworks.AI | Llama 3.1 405B Instruct Turbo | $3.00 | $3.00 | 128,000 |
| Frontier High | Together.AI | Llama 3.1 405B Instruct Turbo | $3.50 | $3.50 | 128,000 |
| Frontier High | Fireworks.AI | Qwen 3 Coder 480B | $0.45 | $1.80 | 256,000 |
| Frontier High | Together.AI | Qwen3-Coder 480B A35B Instruct | $2.00 | $2.00 | 256,000 |
| Frontier Value | Anthropic | Claude Haiku 4.5 | $1.00 | $5.00 | 200,000 |
| Frontier Value | Anthropic | Claude Sonnet 4.5 | $3.00 | $15.00 | 200,000 |
| Frontier Value | Together.AI | Cogito v2 preview – 109B MoE | $0.18 | $0.59 | 32,000 |
| Frontier Value | Together.AI | DeepSeek-V3 | $1.25 | $1.25 | 128,000 |
| Frontier Value | Fireworks.AI | DeepSeek-V3 | $0.56 | $1.68 | 128,000 |
| Frontier Value | OpenAI | GPT-5 mini | $0.25 | $2.00 | 400,000 |
| Frontier Value | OpenAI | GPT-5 nano | $0.05 | $0.40 | 400,000 |
| Frontier Value | Fireworks.AI | Mixtral 8x22B | $1.20 | $1.20 | 32,000 |
| Frontier Value | Together.AI | Qwen3 235B A22B Thinking 2507 FP8 | $0.65 | $3.00 | 256,000 |
| Mid High | Fireworks.AI | >16B parameters | $0.90 | $0.90 | — |
| Mid High | Together.AI | Cogito v2 preview – 70B | $0.88 | $0.88 | 32,000 |
| Mid High | Together.AI | DeepSeek R1 Distilled Llama 70B | $2.00 | $2.00 | 128,000 |
| Mid High | Gemini 2.5 Flash | $0.30 | $2.50 | 1,000,000 | |
| Mid High | Fireworks.AI | GLM-4.6 | $0.55 | $2.19 | 128,000 |
| Mid High | Together.AI | GLM-4.6 | $0.60 | $2.20 | 256,000 |
| Mid High | Together.AI | Kimi K2 Instruct | $1.00 | $3.00 | 128,000 |
| Mid High | Together.AI | Kimi K2 Thinking | $1.20 | $4.00 | 256,000 |
| Mid High | Fireworks.AI | Kimi K2 Thinking | $0.60 | $2.50 | 8,000 |
| Mid High | Together.AI | Llama 3.1 70B Instruct Turbo | $0.88 | $0.88 | 128,000 |
| Mid High | Together.AI | Llama 3.3 70B Instruct-Turbo | $0.88 | $0.88 | 128,000 |
| Mid High | Fireworks.AI | Llama 4 Maverick | $0.22 | $0.88 | 1,000,000 |
| Mid High | Together.AI | Llama 4 Maverick | $0.27 | $0.85 | 1,000,000 |
| Mid High | Together.AI | Qwen 2.5 72B | $1.20 | $1.20 | 32,000 |
| Mid High | Together.AI | Qwen 3 235B | $0.15 | $1.50 | 128,000 |
| Mid High | Fireworks.AI | Qwen 3 235B | $0.22 | $0.88 | 128,000 |
| Mid High | Together.AI | Qwen 3 Next 80B | $0.15 | $1.50 | 256,000 |
| Mid High | Together.AI | Qwen3 Next 80B A3B Instruct | $0.15 | $1.50 | 262,000 |
| Mid High | Together.AI | Qwen3 Next 80B A3B Thinking | $0.15 | $1.50 | 256,000 |
| Mid High | Together.AI | Typhoon 2 70B Instruct | $0.88 | $0.88 | 128,000 |
| Mid Value | Fireworks.AI | 4B - 16B parameters | $0.20 | $0.20 | — |
| Mid Value | Together.AI | DeepSeek R1 Distilled Qwen 14B | $0.18 | $0.18 | 128,000 |
| Mid Value | Together.AI | Llama 3.1 8B Instruct Turbo | $0.18 | $0.18 | 128,000 |
| Mid Value | Fireworks.AI | Llama 4 Scout | $0.15 | $0.60 | 1,000,000 |
| Mid Value | Together.AI | Llama 4 Scout | $0.18 | $0.59 | 1,000,000 |
| Mid Value | Together.AI | Mistral Small 3 | $0.80 | $0.80 | 32,000 |
| Mid Value | Together.AI | Mixtral 8x7B | $0.60 | $0.60 | 32,000 |
| Mid Value | Fireworks.AI | Mixtral 8x7B | $0.50 | $0.50 | 32,000 |
| Mid Value | Together.AI | Qwen2.5 7B Instruct Turbo | $0.30 | $0.30 | 128,000 |
| OSS High | Together.AI | GLM-4.5 Air | $0.20 | $1.10 | 128,000 |
| OSS High | Together.AI | Llama 3 70B Instruct Reference | $0.88 | $0.88 | 8,000 |
| OSS High | Together.AI | Llama 3 8B Instruct | $0.10 | $0.10 | 8,000 |
| OSS High | Together.AI | Mistral 7B Instruct | $0.20 | $0.20 | 32,000 |
| OSS High | Fireworks.AI | Qwen 3 30B | $0.15 | $0.60 | 128,000 |
| OSS High | Together.AI | Qwen QwQ-32B | $1.20 | $1.20 | 128,000 |
| OSS High | Together.AI | Qwen2.5 Coder 32B Instruct | $0.80 | $0.80 | 32,000 |
| OSS Value | Fireworks.AI | <4B Params | $0.10 | $0.10 | — |
| OSS Value | Together.AI | gemma-3n-E4B-it | $0.02 | $0.04 | 32,000 |
| OSS Value | Together.AI | Llama 3.2 3B Instruct Turbo | $0.06 | $0.06 | 128,000 |
| OSS Value | Together.AI | OpenAI gpt-oss-20b | $0.05 | $0.20 | 128,000 |
| OSS Value | Fireworks.AI | OpenAI gpt-oss-20b | $0.07 | $0.30 | 128,000 |
1) Pricing data as of December 2025. Values were taken from official provider pricing pages, including input, output, and max context window. Cached input tokens were excluded due to inconsistent reporting. Models without public pricing or context data were removed.
2) Models shown are the representative models for each pricing category. Tiers reflect list pricing, model family, expected performance class, and provider positioning. Tier medians are computed across these representative models. For a detailed explanation of the tier classifications, see the Model Tier Framework note.
3) This dataset reflects public list prices only; enterprise and volume discounts are not included.