Best AI models for cost-sensitive tasks

Cheap-and-fast models for high-volume work where 80% quality at 5% cost is the right trade.

Triage, classification, lightweight transforms, and high-fanout fanouts (one prompt, many rows) all reward cheap models with predictable latency. Reserve the flagship LLMs for the few prompts where quality actually moves the metric.

Switchy's picks

1
Anthropic: Claude Haiku 4.5
Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...
anthropic200K context$1.00/Mtok in
2
Gemini 2.5 Flash
Fast, affordable Google model with thinking
google1049K context$0.15/Mtok in
3
OpenAI: GPT-4o-mini
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...
openai128K context$0.15/Mtok in
4
OpenAI: GPT-4.1 Mini
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...
openai1048K context$0.40/Mtok in

Other llm models

AI21: Jamba Large 1.7
ai21
A
AionLabs: Aion-1.0
aion-labs
A
AionLabs: Aion-1.0-Mini
aion-labs
A
AionLabs: Aion-2.0
aion-labs
A
AionLabs: Aion-RP 1.0 (8B)
aion-labs
A
AlfredPros: CodeLLaMa 7B Instruct Solidity
alfredpros
AllenAI: Olmo 3 32B Think
allenai
AllenAI: Olmo 3.1 32B Instruct
allenai
Amazon: Nova 2 Lite
amazon
Amazon: Nova Lite 1.0
amazon
Amazon: Nova Micro 1.0
amazon
Amazon: Nova Premier 1.0
amazon

Switchy's picks

Other llm models

Browse all tasks