Llama 4 Maverick

MetaFlagship

Open-weight 17B active params (400B total MoE). Top-tier open model for complex tasks.

Pricing

Input (per 1M tokens)
$0.27
Output (per 1M tokens)
$0.85

Cost Examples

ScenarioCost
100 chat messages (500 in / 200 out each)$0.0305
1K API calls (2K in / 500 out each)$0.9650
10K short requests (200 in / 100 out each)$1.3900
1M tokens input + 100K output$0.3550

Capabilities

Context Window
1.0M
Max Output
33K
Speed
60 tok/s
Released
2025-04-05
YesVision
YesTool Use
YesStreaming
YesJSON Mode

API Details

Model IDmeta-llama/Llama-4-Maverick-17B-128E-Instruct
API Endpointhttps://api.together.xyz/v1/chat/completions
Last Updated2026-04-12

Other Meta Models

Open Cost Calculatorto compare Llama 4 Maverick costs with other models