Every AI model, in one place.
Pricing, benchmarks, provider latency, and how teams actually use each one.
- Newx-aixAI: Grok 4.3
Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...
Language1000k ctx$1.25/M - NewIibm-graniteIBM: Granite 4.1 8B
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...
Language131k ctx$0.05/M - NewopenrouterOwl Alpha
Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....
Language1049k ctx$0.00/M - NewnvidiaNVIDIA: Nemotron 3 Nano Omni (free)
NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...
Language256k ctx$0.00/M - NewPpoolsidePoolside: Laguna XS.2 (free)
Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...
Language131k ctx$0.00/M - NewPpoolsidePoolside: Laguna M.1 (free)
Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K...
Language131k ctx$0.00/M - New~~anthropicAnthropic Claude Haiku Latest
This model always redirects to the latest model in the Anthropic Claude Haiku family.
Language200k ctx$1.00/M - New~~openaiOpenAI GPT Mini Latest
This model always redirects to the latest model in the OpenAI GPT Mini family.
Language400k ctx$0.75/M - New~~googleGoogle Gemini Pro Latest
This model always redirects to the latest model in the Google Gemini Pro family.
Language1049k ctx$2.00/M - New~~moonshotaiMoonshotAI Kimi Latest
This model always redirects to the latest model in the MoonshotAI Kimi family.
Language262k ctx$0.74/M - New~~googleGoogle Gemini Flash Latest
This model always redirects to the latest model in the Google Gemini Flash family.
Language1049k ctx$0.50/M - New~~anthropicAnthropic Claude Sonnet Latest
This model always redirects to the latest model in the Anthropic Claude Sonnet family.
Language1000k ctx$3.00/M - New~~openaiOpenAI GPT Latest
This model always redirects to the latest model in the OpenAI GPT family.
Language1050k ctx$5.00/M - NewqwenQwen: Qwen3.5 Plus 2026-04-20
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...
Language1000k ctx$0.40/M - NewqwenQwen: Qwen3.6 Flash
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...
Language1000k ctx$0.25/M - NewqwenQwen: Qwen3.6 35B A3B
Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated...
Language262k ctx$0.16/M - NewqwenQwen: Qwen3.6 Max Preview
Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and...
Language262k ctx$1.04/M - NewqwenQwen: Qwen3.6 27B
Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid multimodal capabilities — accepting text, image, and video inputs...
Language256k ctx$0.33/M - NewopenaiOpenAI: GPT-5.5 Pro
GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...
Language1050k ctx$30.00/M - NewopenaiOpenAI: GPT-5.5
GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...
Language1050k ctx$5.00/M - NewdeepseekDeepSeek: DeepSeek V4 Pro
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...
Language1049k ctx$0.43/M - NewdeepseekDeepSeek: DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Language1049k ctx$0.14/M - NewIinclusionaiinclusionAI: Ling-2.6-1T (free)
Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...
Language262k ctx$0.00/M - NewTtencentTencent: Hy3 preview (free)
Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to...
Language262k ctx$0.00/M - NewXxiaomiXiaomi: MiMo-V2.5-Pro
MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....
Language1049k ctx$1.00/M - NewXxiaomiXiaomi: MiMo-V2.5
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...
Language1049k ctx$0.40/M - NewIinclusionaiinclusionAI: Ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Language262k ctx$0.08/M - New~~anthropicAnthropic: Claude Opus Latest
This model always redirects to the latest model in the Claude Opus family.
Language1000k ctx$5.00/M - NewopenrouterPareto Code Router
The Pareto Router is a way to have OpenRouter always pick a strong coding model for your needs without committing to a specific one. You express a single `min_coding_score` preference...
Language200k ctxFree tier - NewBbaiduBaidu: Qianfan-OCR-Fast (free)
Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR.
Language66k ctx$0.00/M - NewmoonshotaiMoonshotAI: Kimi K2.6
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...
Language262k ctx$0.74/M - NewanthropicAnthropic: Claude Opus 4.7
Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...
Language1000k ctx$5.00/M - NewanthropicAnthropic: Claude Opus 4.6 (Fast)
Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode
Language1000k ctx$30.00/M - Newz-aiZ.ai: GLM 5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...
Language203k ctx$1.05/M - NewgoogleGoogle: Gemma 4 26B A4B
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Language262k ctx$0.06/M - NewgoogleGoogle: Gemma 4 26B A4B (free)
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Language262k ctx$0.00/M - NewgoogleGoogle: Gemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Language262k ctx$0.13/M - NewgoogleGoogle: Gemma 4 31B (free)
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Language262k ctx$0.00/M - NewqwenQwen: Qwen3.6 Plus
Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...
Language1000k ctx$0.33/M - Newz-aiZ.ai: GLM 5V Turbo
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
Language203k ctx$1.20/M - Newarcee-aiArcee AI: Trinity Large Thinking
Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7
Language262k ctx$0.22/M - Newx-aixAI: Grok 4.20 Multi-Agent
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information...
Language2000k ctx$2.00/M - Newx-aixAI: Grok 4.20
Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently...
Language2000k ctx$1.25/M - NewKkwaipilotKwaipilot: KAT-Coder-Pro V2
KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...
Language256k ctx$0.30/M - NewRrekaaiReka Edge
Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...
Language16k ctx$0.10/M - NewXxiaomiXiaomi: MiMo-V2-Omni
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
Language262k ctx$0.40/M - NewXxiaomiXiaomi: MiMo-V2-Pro
MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like...
Language1049k ctx$1.00/M - NewMminimaxMiniMax: MiniMax M2.7
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...
Language197k ctx$0.30/M