LLMupstage

Upstage: Solar Pro 3

Solar Pro 3 is Upstage's powerful Mixture-of-Experts (MoE) language model. With 102B total parameters and 12B active parameters per forward pass, it delivers exceptional performance while maintaining computational efficiency. Optimized...

Anyone in the Space can @-mention Upstage: Solar Pro 3 with the team's shared context - pooled credits, one chat, one memory.

All models

Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Solar Pro 3 delivers strong general-purpose performance at a price point that undercuts many frontier models. With a 128K context window and $0.15/$0.60 per Mtok pricing, it sits in the sweet spot for teams running high-volume workflows where GPT-4-class intelligence matters but API costs add up fast. The lack of public benchmark data means you're buying on Upstage's reputation rather than verified numbers. Best for cost-conscious teams already comfortable with Upstage's ecosystem who need long-context reasoning without the OpenAI or Anthropic price tag.

Best for

  • High-volume workflows needing cost control
  • Long-context document processing under budget
  • Teams already using Upstage models
  • General reasoning tasks at scale

Strengths

The 128K context window handles full-length documents, codebases, and multi-turn conversations without truncation. At $0.15 input and $0.60 output per million tokens, it runs roughly 40% cheaper than GPT-4o for comparable workloads. Upstage's previous Solar models showed competitive performance on Korean and English tasks, suggesting this iteration maintains strong multilingual capabilities. The pricing structure favors read-heavy use cases where you're processing large inputs and generating concise outputs.

Trade-offs

No public benchmarks means you can't compare Solar Pro 3 directly against Claude, GPT-4o, or Gemini on standard evals like MMLU or HumanEval. This makes it harder to predict performance on your specific tasks without running your own tests. The model likely trails frontier options on complex reasoning, advanced math, and nuanced instruction-following based on typical mid-tier model behavior. Output quality on specialized domains—legal analysis, medical reasoning, advanced code generation—remains unverified without benchmark transparency.

Specifications

Provider
upstage
Category
llm
Context length
128,000 tokens
Max output
Modalities
text
License
proprietary
Released
2026-01-27

Pricing

Input
$0.15/Mtok
Output
$0.60/Mtok
Model ID
upstage/solar-pro-3

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.

Team cost calculator

Estimated monthly spend
$5.02
17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.

Providers

ProviderContextInputOutputP50 latencyThroughput30d uptime
upstage128k$0.15/Mtok$0.60/Mtok

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Summarize Long Documents

Read this full document and provide a 300-word executive summary highlighting the three most critical findings, key data points, and recommended actions.
Open in a Space →

Analyze Multi-File Codebases

Review these source files as a complete codebase. Identify architectural patterns, potential bugs, and suggest two high-impact refactoring opportunities with code examples.
Open in a Space →

Extract Structured Data

Extract all company names, funding amounts, and dates from this news article. Return results as a JSON array with fields: company, amount, date, source_sentence.
Open in a Space →

Draft Customer Responses

Based on this customer's full support ticket history and our product documentation, draft a 150-word response that addresses their question and suggests next steps.
Open in a Space →

Compare Multiple Sources

Compare these four research papers on the same topic. Identify where they agree, where they conflict, and highlight any methodology differences that explain discrepancies.
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

Review this Python function for potential bugs and suggest improvements. Focus on edge cases and performance: ```python def calculate_discount(price, discount_percent): return price - (price * discount_percent / 100) ```

Output

The function has several issues: 1. **No input validation**: Negative prices or discount percentages above 100 could produce nonsensical results. Add guards: `if price < 0 or discount_percent < 0 or discount_percent > 100: raise ValueError` 2. **Floating-point precision**: For currency calculations, this can accumulate rounding errors. Consider using `Decimal` from the decimal module for financial accuracy. 3. **Missing type hints**: Add `def calculate_discount(price: float, discount_percent: float) -> float:` for better IDE support and documentation. Improved version: ```python from decimal import Decimal def calculate_discount(price: Decimal, discount_percent: Decimal) -> Decimal: if price < 0 or not (0 <= discount_percent <= 100): raise ValueError("Invalid price or discount") return price * (1 - discount_percent / 100) ```

Notes

Solar Pro 3's 128K context window enables it to handle substantial code review requests with full repository context. This example shows its methodical approach to identifying multiple issue categories (validation, precision, documentation). The model provides concrete fixes rather than vague suggestions, though without benchmark data we can't compare its accuracy to specialized code models.

Prompt

Summarize the key regulatory changes in this 8-page compliance memo and create a checklist of action items for our legal team. [memo text would follow]

Output

**Key Regulatory Changes:** 1. **Data Retention Requirements** (Section 2.3): New 7-year retention mandate for customer transaction records, effective Q3 2024 2. **Cross-Border Transfer Rules** (Section 4.1): Enhanced consent requirements for EU data transfers; standard contractual clauses now mandatory 3. **Breach Notification Timeline** (Section 5.2): Reporting window reduced from 72 to 48 hours for material breaches **Action Items Checklist:** - [ ] Audit current data retention policies by June 15 - [ ] Update privacy notices with new consent language - [ ] Implement automated breach detection monitoring - [ ] Train customer service on new transfer protocols - [ ] Review vendor contracts for SCC compliance - [ ] Schedule Q2 compliance review with external counsel

Notes

The 128K token context window is the standout feature here — Solar Pro 3 can ingest lengthy legal documents without truncation, making it practical for enterprise document analysis. At $0.15 per million input tokens, processing an 8-page memo costs fractions of a cent. The output demonstrates structured extraction, though we lack benchmark comparisons to gauge its accuracy against models like Claude or GPT-4 on legal reasoning tasks.

Prompt

I'm preparing a technical presentation on microservices architecture for non-technical stakeholders. Explain the concept using an analogy they'll understand, then outline three key benefits and three common pitfalls.

Output

**Analogy:** Think of microservices like a food court versus a traditional restaurant. In a restaurant (monolithic architecture), one kitchen handles everything — if the grill breaks, the entire operation stops. In a food court (microservices), each vendor operates independently. If the pizza oven fails, customers can still get tacos, sushi, and sandwiches. Each vendor specializes, scales independently, and can update their menu without coordinating with others. **Three Key Benefits:** 1. **Independent scaling**: High-traffic services get more resources without over-provisioning the entire system 2. **Faster deployment**: Teams ship updates to individual services without system-wide releases 3. **Technology flexibility**: Each service can use the optimal tech stack for its specific job **Three Common Pitfalls:** 1. **Operational complexity**: Managing dozens of services requires sophisticated monitoring and orchestration 2. **Network overhead**: Services communicate over networks, adding latency and failure points 3. **Data consistency challenges**: Maintaining transactional integrity across distributed services is significantly harder than in monolithic systems

Notes

This example highlights Solar Pro 3's ability to adapt technical explanations to audience level while maintaining accuracy. The pricing model ($0.60/Mtok output) makes it cost-effective for generating presentation materials and internal documentation at scale. However, without published benchmarks on instruction-following or technical writing tasks, it's unclear how its explanatory quality compares to established alternatives in the $0.15-$0.60 pricing tier.

Use-case deep-dives

Multi-document contract synthesis

Solar Pro 3 handles 128K-token contract bundles without summarization loss

A 4-person legal ops team receives 20-30 vendor agreements weekly, each with 3-5 referenced exhibits totaling 40K-60K tokens per bundle. Solar Pro 3's 128K context window means you can drop the entire package—master agreement, SOWs, addenda—into a single prompt and ask for liability clause extraction or renewal date rollups without pre-chunking. At $0.15 input, processing a 50K-token bundle costs $0.0075; the output summary runs another 2K tokens at $0.60/Mtok ($0.0012). Total per-contract cost sits around $0.009, which beats hiring a paralegal to manually cross-reference exhibits. If your bundles regularly exceed 100K tokens or you need sub-200ms latency, you'll hit the context ceiling or wait longer than Claude 3.5 Sonnet. Otherwise, this is the move for teams that want one-shot document synthesis without a RAG stack.

Overnight batch content localization

When Solar Pro 3 beats GPT-4o on cost for high-volume translation runs

A 12-person SaaS company ships 80K words of help-center content monthly across 6 languages (480K words/month total output). Solar Pro 3's $0.60/Mtok output pricing means each 1K-word article costs roughly $0.0012 to translate (assuming 1.5K output tokens per article). At 480 articles/month, that's $0.58 in output fees plus negligible input costs—under $1/month for the entire localization workload. GPT-4o charges $15/Mtok output, pushing the same job to $10.80/month. The 18× cost gap matters when you're running this nightly. The trade-off: Solar Pro 3 has no public FLORES or WMT scores, so you're flying blind on translation quality until you sample 20-30 outputs. If quality holds, lock this in and redirect the savings to human QA spot-checks.

Real-time customer email triage

Solar Pro 3 for sub-$5/month support routing at 200 emails/day

A 6-person e-commerce support team receives 200 inbound emails daily (6K/month) averaging 300 tokens each. They need instant routing: refund vs. shipping vs. product question. Solar Pro 3 processes each email for $0.000045 input + $0.00012 output (200 tokens), totaling $0.000165 per message or $0.99/month at 6K emails. The 128K context window is overkill here, but the pricing floor makes this cheaper than Gemini Flash ($0.00001875 input, $0.000075 output = $0.0000938/email) only if your output consistently exceeds 125 tokens per classification. If you're writing 20-word routing labels, Flash wins. If you're generating 150-word draft responses alongside the route, Solar Pro 3 saves $0.60/month at this volume—not huge, but it scales linearly. Test both on 100 real emails and pick the one that nails your label taxonomy.

Frequently asked

Is Solar Pro 3 good for general text tasks?

Solar Pro 3 handles standard text generation, summarization, and Q&A competently with its 128k context window. Without public benchmarks we can't verify its performance against GPT-4 or Claude, but the pricing suggests it targets mid-tier workloads. If you need proven quality for production, test it against established models first.

Is Solar Pro 3 cheaper than GPT-4o?

Yes, significantly. Solar Pro 3 costs $0.15 input and $0.60 output per million tokens versus GPT-4o's $2.50 input and $10.00 output. You're paying roughly 6-16× less depending on your input/output ratio. The trade-off is uncertain quality since Upstage hasn't published benchmark scores for this model.

Can Solar Pro 3 handle 128k tokens in practice?

The 128k context window matches GPT-4 and Claude 3 Sonnet, so it can technically process full-length books or large codebases. Real-world performance at max context depends on how Upstage trained the model. Test with your actual data at 100k+ tokens before committing to production workflows that rely on full-context accuracy.

How does Solar Pro 3 compare to Solar Pro 2?

We don't have benchmark data for either generation, so we can't quantify the improvement. If Upstage followed typical upgrade patterns, expect better instruction-following and reasoning. The context window and pricing appear unchanged. Request access to both versions and run your own evals if you're considering migration.

Should I use Solar Pro 3 for high-volume chatbots?

The pricing makes it attractive for high-volume use cases where cost per conversation matters more than peak quality. At $0.60 per million output tokens, you can serve 16× more responses than with GPT-4o for the same budget. Just validate response quality meets your bar since public benchmarks aren't available.

Data last verified 7 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.