# Qwen: Qwen3 VL 32B Instruct

Provider: qwen  
Category: llm  
Model ID: `qwen/qwen3-vl-32b-instruct`

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

## Specs

- Context length: 131072 tokens
- Max output: 32768 tokens
- Modalities: text, image
- Released: 2025-10-23

## Pricing

- Input: $0.10 per million tokens
- Output: $0.42 per million tokens

## Providers

- **qwen** — ctx 131072, input $0.10/M, output $0.42/M

---
Last verified: 2026-04-23T23:46:29.618Z  
Canonical URL: https://switchy.build/models/qwen3-vl-32b-instruct