# NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Provider: nvidia  
Category: llm  
Model ID: `nvidia/llama-3.3-nemotron-super-49b-v1.5`

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

## Specs

- Context length: 131072 tokens
- Max output: unknown tokens
- Modalities: text
- Released: 2025-10-10

## Pricing

- Input: $0.10 per million tokens
- Output: $0.40 per million tokens

## Providers

- **nvidia** — ctx 131072, input $0.10/M, output $0.40/M

---
Last verified: 2026-04-23T23:46:29.618Z  
Canonical URL: https://switchy.build/models/llama-3-3-nemotron-super-49b-v1-5