Meta: Llama 4 Maverick
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...
Anyone in the Space can @-mention Meta: Llama 4 Maverick with the team's shared context — pooled credits, one chat, one memory.
Starter is free forever — 1 Space, 100 credits/month, 1 MCP. No card.
Specifications
- Provider
- meta-llama
- Category
- llm
- Context length
- 1,048,576 tokens
- Max output
- 16,384 tokens
- Modalities
- text, image
- License
- proprietary
- Released
- 2025-04-05
Pricing
- Input
- $0.15/Mtok
- Output
- $0.60/Mtok
- Model ID
meta-llama/llama-4-maverick
Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool — one plan, one balance for everyone.
Team cost calculator
5 seats · 80 msgs/day
Switchy meters this against your org's shared credit pool — one plan, one balance for everyone.
Providers
| Provider | Context | Input | Output | P50 latency | Throughput | 30d uptime |
|---|---|---|---|---|---|---|
| meta-llama | 1049k | $0.15/Mtok | $0.60/Mtok | — | — | — |
Performance
Benchmarks
Works well with
Top MCPs
Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.