Llama 4 Scout
MetaStandardOpen-weight 17B active params (109B total MoE). Strong multilingual and long-context performance.
Pricing
Input (per 1M tokens)
$0.18
Output (per 1M tokens)
$0.35
Cost Examples
| Scenario | Cost |
|---|---|
| 100 chat messages (500 in / 200 out each) | $0.0160 |
| 1K API calls (2K in / 500 out each) | $0.5350 |
| 10K short requests (200 in / 100 out each) | $0.7100 |
| 1M tokens input + 100K output | $0.2150 |
Capabilities
Context Window
512K
Max Output
33K
Speed
100 tok/s
Released
2025-04-05
YesVision
YesTool Use
YesStreaming
YesJSON Mode
API Details
Model ID
meta-llama/Llama-4-Scout-17B-16E-InstructAPI Endpoint
https://api.together.xyz/v1/chat/completionsLast Updated2026-04-12
Other Meta Models
Open Cost Calculatorto compare Llama 4 Scout costs with other models