LLMperplexity

Perplexity: Sonar Deep Research

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...

Anyone in the Space can @-mention Perplexity: Sonar Deep Research with the team's shared context - pooled credits, one chat, one memory.

All models

Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Sonar Deep Research is Perplexity's specialized model for multi-step research tasks that require synthesizing information across dozens of sources. It excels at producing comprehensive reports with inline citations, trading speed and cost for thoroughness. At $8/Mtok output, it's 4-8× more expensive than general-purpose models, so reserve it for tasks where breadth of research justifies the premium. Reach for this when you need a research assistant that reads widely before answering, not a fast chatbot.

Best for

Multi-source research reports with citations
Competitive analysis across industry sources
Literature reviews spanning academic papers
Market research requiring diverse perspectives
Fact-checking claims against multiple sources

Strengths

Sonar Deep Research performs multi-step web searches before generating responses, pulling from dozens of sources to build comprehensive answers. The model returns inline citations for every claim, making it trivial to verify statements or dive deeper into source material. Its 128K context window accommodates large research briefs, and the architecture prioritizes recall over speed — it will spend 30-60 seconds researching before responding, which suits investigative tasks where thoroughness matters more than latency.

Trade-offs

Output costs are steep at $8/Mtok — roughly 4× Claude Sonnet 4 and 8× GPT-4o — making this model expensive for conversational use or iterative drafting. Response latency runs 30-90 seconds as the model conducts searches, which breaks interactive workflows. The 128K context is modest compared to Gemini 2.0 Flash's 1M window, limiting its ability to synthesize book-length documents. No public benchmarks exist yet, so performance on standard reasoning tasks remains unquantified relative to peers.

Specifications

Provider: perplexity
Category: llm
Context length: 128,000 tokens
Max output: —
Modalities: text
License: proprietary
Released: 2025-03-07

Pricing

Input: $2.00/Mtok
Output: $8.00/Mtok
Model ID: perplexity/sonar-deep-research

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.

Team cost calculator

Seats5 peopleMessages / seat / day80Avg turn size2 ktokOutput share30 %

Estimated monthly spend

$66.88

17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.

Providers

Provider	Context	Input	Output	P50 latency	Throughput	30d uptime
perplexity	128k	$2.00/Mtok	$8.00/Mtok	—	—	—

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Competitive Landscape Report

Research the competitive landscape for [product category]. Identify the top 5 players, their pricing strategies, key differentiators, and recent funding or product launches. Cite all sources.

Open in a Space →

Technical Feasibility Assessment

Assess the feasibility of using [technology/approach] for [use case]. Survey recent academic papers, production implementations, and known limitations. Provide citations for all claims.

Open in a Space →

Regulatory Compliance Summary

Summarize [regulation type] requirements for [industry] across the US, EU, and UK. Highlight key differences and recent updates. Cite official sources.

Open in a Space →

Historical Context Deep Dive

Trace the development of [concept/event] from [start year] to present. Include key milestones, influential figures, and shifts in understanding. Cite historical sources.

Open in a Space →

Product Feature Comparison

Compare [feature category] across [Product A], [Product B], and [Product C]. Include pricing, limitations, and user feedback. Cite product docs and reviews.

Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

Research the current state of sodium-ion battery technology for grid storage. I need a comprehensive overview of commercial viability, key players, and technical limitations compared to lithium-ion.

Output

Sodium-ion batteries have reached commercial deployment as of 2023, with CATL and BYD shipping grid-scale installations in China. The technology offers 70-80% of lithium-ion's energy density (140-160 Wh/kg vs 200-250 Wh/kg) but eliminates cobalt dependency and performs better in extreme cold. Cost parity exists at the cell level ($80-90/kWh), though system integration remains 15-20% more expensive due to lower voltage. Faradion (acquired by Reliance) and Natron Energy lead Western development, targeting stationary storage rather than EVs. The main technical barrier is cycle life: current sodium-ion chemistries deliver 3,000-5,000 cycles versus 6,000-8,000 for LFP lithium-ion.

Notes

This example highlights Sonar Deep Research's strength in synthesizing technical information across multiple sources with specific numbers and company names. The 128K context window allows processing extensive research papers and reports in a single pass. However, without benchmark data, we can't verify citation accuracy or hallucination rates—critical for research tasks.

Prompt

I'm evaluating whether to build our SaaS product on Supabase or Firebase. Compare their real-time capabilities, pricing at 50K MAU, and vendor lock-in risks. Include recent community sentiment.

Output

Supabase's real-time engine uses PostgreSQL's logical replication, giving you row-level subscriptions with SQL filtering—Firebase's Firestore offers document listeners but no server-side filtering before transmission. At 50K MAU, Supabase estimates $200-400/month (database + auth + storage), while Firebase typically runs $300-600 depending on read patterns. Lock-in differs: Supabase runs on standard Postgres, so migration paths exist to RDS or self-hosted; Firebase's Firestore structure requires application rewrites to exit. Recent Reddit and HN threads (Q4 2024) show Supabase users praising the SQL flexibility but reporting occasional connection pool exhaustion under traffic spikes—Firebase users cite predictable scaling but frustration with query limitations.

Notes

Demonstrates the model's ability to synthesize technical comparisons with pricing data and community signals. The research depth suits technical decision-making. The $2 input cost means a thorough research query could run $0.20-0.40, which is reasonable for high-stakes decisions but expensive for casual exploration.

Prompt

What are the regulatory and technical barriers to deploying autonomous delivery robots in European cities? Focus on Germany, France, and the UK with specific examples from 2023-2024.

Output

Germany's StVG amendment (June 2023) permits sidewalk robots under 25 km/h in designated zones, but municipalities retain veto power—Berlin approved trials in Mitte, while Munich blocked them citing pedestrian density. France's February 2024 decree limits robots to 6 km/h in pedestrian areas and requires human remote supervision within 100m, effectively preventing scalability. The UK's regulatory sandbox allows broader testing, with Starship Technologies operating 100+ robots in Milton Keynes and Northampton, though the Highway Code still classifies them as 'invalid carriages.' Technical barriers center on winter operation (snow/ice detection remains unreliable) and theft prevention—Serve Robotics reported 12% of Paris trial units damaged or stolen versus 2% in controlled UK environments.

Notes

Showcases Sonar Deep Research's strength in regulatory analysis with specific dates, legislation names, and real-world deployment data. The model appears designed for this multi-source synthesis work. The $8/Mtok output cost means a 500-word research summary like this costs roughly $0.004—negligible, making the input cost the primary consideration.

Use-case deep-dives

Weekly competitive intelligence reports

When Sonar Deep Research beats hiring a junior analyst for market tracking

A 12-person B2B SaaS startup needs weekly competitor feature updates and pricing changes but can't justify a full-time analyst. Sonar Deep Research is the right call here. The model searches live web sources and synthesizes findings into structured reports—exactly what you'd brief an intern to do, but at $2/$8 per Mtok instead of $4k/month salary. The 128k context window handles multi-company comparisons in one pass. You'll spend roughly $0.40-$1.20 per report depending on output length. The threshold: if you need daily updates or sub-hour latency, this won't work—Sonar's research mode adds 30-90 seconds per query. But for weekly or bi-weekly cadences where thoroughness beats speed, route these prompts to Sonar and keep your team focused on product.

Due diligence document synthesis

Why legal teams use Sonar for M&A research phases, not contract drafting

A 4-attorney firm handling mid-market acquisitions needs to pull regulatory filings, news archives, and public records on target companies before the letter of intent. Sonar Deep Research is built for this: it retrieves and cross-references sources that a paralegal would manually compile over 6-8 hours, then summarizes findings with citations in 3-5 minutes. The $8/Mtok output cost is high compared to Gemini 1.5 Flash ($0.30) or GPT-4o Mini ($0.60), but you're paying for the research layer—not just inference. The model won't draft the diligence memo or negotiate terms; it front-loads the discovery phase. If your workflow is 80% research and 20% writing, Sonar pays off. If it's inverted—mostly drafting with light fact-checking—use a cheaper model with web search as a fallback tool.

Customer support knowledge base audits

When Sonar Deep Research finds gaps in your help docs faster than your support lead

A 25-person e-commerce platform runs a Zendesk help center with 200+ articles, but ticket volume suggests users aren't finding answers. The support lead wants to audit coverage: which common questions have no article, which articles contradict each other, which are outdated. Sonar Deep Research handles this in one 128k-token pass—it crawls your public docs, cross-references ticket themes (you paste summaries), and flags gaps with citations. You'll pay roughly $1.50-$3.00 for a full audit depending on corpus size. Compare that to 12-16 hours of manual work. The catch: Sonar won't rewrite the articles or generate new ones—it's a diagnostic tool. If you need the audit plus the rewrites, pair Sonar for discovery with Claude 3.5 Sonnet for drafting. For quarterly or bi-annual audits, this is the fastest path to a prioritized fix list.

Frequently asked

Is Perplexity Sonar Deep Research good for research tasks?

Yes, it's purpose-built for deep research workflows. The 128K context window lets you feed in multiple papers or long documents at once. It's designed to synthesize information across sources rather than just answer quick queries. If you need citation-heavy analysis or literature reviews, this is the model to use.

Is Perplexity Sonar Deep Research cheaper than GPT-4?

Input is cheaper at $2/Mtok versus GPT-4's typical $10-30/Mtok, but output costs $8/Mtok which sits in the mid-range. For research tasks where you're feeding in large documents and getting back detailed summaries, the input savings matter more. You'll spend less than GPT-4 Turbo on most research workflows.

Can it handle 128K tokens without degrading quality?

The 128K window is advertised, but without public benchmarks we can't verify retrieval accuracy across the full context. Most models show quality drops past 64K tokens. Test it on your actual documents before trusting it with critical research at maximum capacity. Start with 50-80K token loads and validate outputs.

How does this compare to standard Perplexity Sonar models?

Deep Research is optimized for multi-step reasoning and synthesis versus Sonar's fast-answer design. You're trading speed for depth. Standard Sonar models return results in seconds; Deep Research takes longer but produces more thorough analysis. Use standard Sonar for quick lookups, Deep Research for actual research projects.

Should I use this for customer-facing chat applications?

No. It's too slow and expensive for real-time chat. The $8/Mtok output cost adds up fast in conversational loops, and users expect sub-second responses. Use GPT-4o-mini or Claude Haiku for chat. Reserve Deep Research for backend analysis tasks where latency doesn't matter and thoroughness does.