Every AI model, in one place.
Pricing, benchmarks, provider latency, and how teams actually use each one.
- NnousresearchNous: Hermes 3 405B Instruct (free)
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
Language131k ctx$0.00/M - Ssao10kSao10K: Llama 3 8B Lunaris
Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....
Language8k ctx$0.04/M - openaiOpenAI: GPT-4o (2024-08-06)
The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...
Language128k ctx$2.50/M - meta-llamaMeta: Llama 3.1 70B Instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...
Language131k ctx$0.40/M - meta-llamaMeta: Llama 3.1 8B Instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...
Language16k ctx$0.02/M - mistralaiMistral: Mistral Nemo
A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...
Language131k ctx$0.02/M - openaiOpenAI: GPT-4o-mini
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...
Language128k ctx$0.15/M - openaiOpenAI: GPT-4o-mini (2024-07-18)
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...
Language128k ctx$0.15/M - googleGoogle: Gemma 2 27B
Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...
Language8k ctx$0.65/M - Ssao10kSao10k: Llama 3 Euryale 70B v2.1
Euryale 70B v2.1 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). - Better prompt adherence. - Better anatomy / spatial awareness. - Adapts much better to unique and custom...
Language8k ctx$1.48/M - NnousresearchNousResearch: Hermes 2 Pro - Llama-3 8B
Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced...
Language8k ctx$0.14/M - openaiOpenAI: GPT-4o
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...
Language128k ctx$2.50/M - openaiOpenAI: GPT-4o (2024-05-13)
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...
Language128k ctx$5.00/M - meta-llamaMeta: Llama 3 70B Instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...
Language8k ctx$0.51/M - meta-llamaMeta: Llama 3 8B Instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...
Language8k ctx$0.03/M - mistralaiMistral: Mixtral 8x22B Instruct
Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...
Language66k ctx$2.00/M - microsoftWizardLM-2 8x22B
WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is...
Language66k ctx$0.62/M - openaiOpenAI: GPT-4 Turbo
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.
Language128k ctx$10.00/M - anthropicAnthropic: Claude 3 Haiku
Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal
Language200k ctx$0.25/M - mistralaiMistral Large
This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....
Language128k ctx$2.00/M - openaiOpenAI: GPT-3.5 Turbo (older v0613)
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
Language4k ctx$1.00/M - openaiOpenAI: GPT-4 Turbo Preview
The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. **Note:** heavily rate limited by OpenAI while...
Language128k ctx$10.00/M - mistralaiMistral: Mixtral 8x7B Instruct
Mixtral 8x7B Instruct is a pretrained generative Sparse Mixture of Experts, by Mistral AI, for chat and instruction use. Incorporates 8 experts (feed-forward networks) for a total of 47 billion...
Language33k ctx$0.54/M - AalpindaleGoliath 120B
A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to - [@chargoddard](https://huggingface.co/chargoddard) for developing the framework used to merge...
Language6k ctx$3.75/M - openrouterAuto Router
"Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...
Image2000k ctxFree tier - openaiOpenAI: GPT-4 Turbo (older v1106)
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to April 2023.
Language128k ctx$10.00/M - mistralaiMistral: Mistral 7B Instruct v0.1
A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.
Language3k ctx$0.11/M - openaiOpenAI: GPT-3.5 Turbo Instruct
This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.
Language4k ctx$1.50/M - openaiOpenAI: GPT-3.5 Turbo 16k
This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...
Language16k ctx$3.00/M - MmancerMancer: Weaver (alpha)
An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.
Language8k ctx$0.75/M - Uundi95ReMM SLERP 13B
A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge
Language6k ctx$0.45/M - GgrypheMythoMax 13B
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
Language4k ctx$0.06/M - openaiOpenAI: GPT-3.5 Turbo
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
Language16k ctx$0.50/M - openaiOpenAI: GPT-4
OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning...
Language8k ctx$30.00/M - openaiOpenAI: GPT-4 (older v0314)
GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data: up to Sep 2021.
Language8k ctx$30.00/M - openaiDALL-E 2
Previous generation with good performance
Image4k ctx$18.00/M - openaiDALL-E 3
Most advanced image generation model
Image4k ctx$40.00/M - googleGemini 2.5 Flash
Fast, affordable Google model with thinking
Language1049k ctx$0.15/M - mistralaiMistral Large 2
Flagship Mistral model with strong multilingual
Language128k ctx$2.00/M - BbytedanceSeedance 1.5 Pro
Simultaneous video and audio generation with multi-language lip-sync and cinematic camera control.
Video0$0.00/M - BbytedanceSeedance 2.0
First unified audio-video joint generation model with phoneme-level lip-sync in 8+ languages.
Video0$0.00/M - BbytedanceSeedance 2.0 Fast
Speed-optimized Seedance variant prioritizing fast generation and lower cost.
Video0$0.00/M - openaiSora 2 Pro
Production-quality video with physics-accurate motion, synchronized audio, and world-state persistence across shots.
Video0$0.00/M - SstabilityaiStable Diffusion 3
Open-source image generation with great quality
Image1k ctx$6.00/M - openaiTTS-1 (Fast)
Fast text-to-speech with natural voice
Voice4k ctx$0.01/M - openaiTTS-1 HD (High Quality)
High-quality text-to-speech output
Voice4k ctx$0.03/M - googleVeo 3.1
State-of-the-art video generation built for maximum visual fidelity in final production cuts.
Video0$0.00/M - alibabaWan 2.6
Unified video generation system supporting 10+ visual creation capabilities.
Video0$0.00/M