Llama 4 Maverick
MetaFlagshipOpen-weight 17B active params (400B total MoE). Top-tier open model for complex tasks.
Pricing
Input (per 1M tokens)
$0.27
Output (per 1M tokens)
$0.85
Cost Examples
| Scenario | Cost |
|---|---|
| 100 chat messages (500 in / 200 out each) | $0.0305 |
| 1K API calls (2K in / 500 out each) | $0.9650 |
| 10K short requests (200 in / 100 out each) | $1.3900 |
| 1M tokens input + 100K output | $0.3550 |
Capabilities
Context Window
1.0M
Max Output
33K
Speed
60 tok/s
Released
2025-04-05
YesVision
YesTool Use
YesStreaming
YesJSON Mode
API Details
Model ID
meta-llama/Llama-4-Maverick-17B-128E-InstructAPI Endpoint
https://api.together.xyz/v1/chat/completionsLast Updated2026-04-12
Other Meta Models
Open Cost Calculatorto compare Llama 4 Maverick costs with other models