# Qwen: Qwen3 VL 8B Instruct

Provider: qwen  
Category: llm  
Model ID: `qwen/qwen3-vl-8b-instruct`

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

## Specs

- Context length: 131072 tokens
- Max output: 32768 tokens
- Modalities: image, text
- Released: 2025-10-14

## Pricing

- Input: $0.08 per million tokens
- Output: $0.50 per million tokens

## Providers

- **qwen** — ctx 131072, input $0.08/M, output $0.50/M

---
Last verified: 2026-04-23T23:46:29.618Z  
Canonical URL: https://switchy.build/models/qwen3-vl-8b-instruct