Best AI models for vision analysis
Multimodal models that read screenshots, diagrams, charts, and document layouts.
Vision is uneven across models — some excel at chart reading, others at OCR-heavy document layouts, others at UI screenshots. Most flagship LLMs now ship vision; quality differences only show up under stress (small fonts, dense diagrams, mixed languages).
Switchy's picks
- 1Anthropic: Claude Sonnet 4.5
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...
anthropic1000K context$3.00/Mtok in - 2OpenAI: GPT-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...
openai1048K context$2.00/Mtok in - 3Anthropic: Claude Opus 4.5
Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...
anthropic200K context$5.00/Mtok in - 4Google: Gemini 2.5 Pro Preview 06-05
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
google1049K context$1.25/Mtok in
Other llm models
- AI21: Jamba Large 1.7ai21
- AAionLabs: Aion-1.0aion-labs
- AAionLabs: Aion-1.0-Miniaion-labs
- AAionLabs: Aion-2.0aion-labs
- AAionLabs: Aion-RP 1.0 (8B)aion-labs
- AAlfredPros: CodeLLaMa 7B Instruct Solidityalfredpros
- AllenAI: Olmo 3 32B Thinkallenai
- AllenAI: Olmo 3.1 32B Instructallenai
- Amazon: Nova 2 Liteamazon
- Amazon: Nova Lite 1.0amazon
- Amazon: Nova Micro 1.0amazon
- Amazon: Nova Premier 1.0amazon