LLMwriter

Writer: Palmyra X5

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million...

Anyone in the Space can @-mention Writer: Palmyra X5 with the team's shared context - pooled credits, one chat, one memory.

All models

Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Palmyra X5 delivers a massive 1M+ token context window at $0.60/$6.00 per Mtok, making it the go-to choice for processing entire codebases, legal document sets, or multi-hour transcripts in a single pass. The pricing undercuts most frontier models on input while keeping output costs reasonable. Without public benchmarks, you're trading proven performance data for context capacity and cost efficiency. Reach for this when your task is bottlenecked by context length, not reasoning complexity.

Best for

Processing entire codebases in one context
Multi-document legal analysis and synthesis
Long-form transcript summarization
Cost-sensitive high-volume text processing
Research literature review across dozens of papers

Strengths

The 1.04M token context window handles workloads that force other models into chunking strategies — ingest a full repository, compare 50 contracts, or analyze a day's worth of meeting transcripts without losing cross-reference accuracy. Input pricing at $0.60/Mtok runs 40-80% cheaper than Claude or GPT-4 class models, making it viable for batch processing tasks that would otherwise blow budgets. Output at $6.00/Mtok stays competitive when you need concise summaries from massive inputs.

Trade-offs

No public benchmark scores means you can't compare reasoning quality, code generation accuracy, or instruction-following against Sonnet, GPT-4o, or Gemini before committing. Early-stage models from smaller vendors sometimes lag on nuanced tasks like multi-step logic, creative writing, or handling ambiguous instructions. The lack of multimodal support limits use cases to pure text — no vision, no audio. If your task needs proven performance on MMLU, HumanEval, or MT-Bench, you'll want established alternatives with track records.

Specifications

Provider: writer
Category: llm
Context length: 1,040,000 tokens
Max output: 8,192 tokens
Modalities: text
License: proprietary
Released: 2026-01-21

Pricing

Input: $0.60/Mtok
Output: $6.00/Mtok
Model ID: writer/palmyra-x5

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.

Team cost calculator

Seats5 peopleMessages / seat / day80Avg turn size2 ktokOutput share30 %

Estimated monthly spend

$39.07

17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.

Providers

Provider	Context	Input	Output	P50 latency	Throughput	30d uptime
writer	1040k	$0.60/Mtok	$6.00/Mtok	—	—	—

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Codebase Architecture Summary

I'm pasting our complete application codebase below. Generate an architecture summary covering: (1) main components and their responsibilities, (2) data flow between modules, (3) external dependencies, and (4) three highest-priority refactoring opportunities. Focus on structural insights, not line-by-line review.

Open in a Space →

Multi-Contract Comparison

Below are 30 vendor contracts from the past two years. Create a comparison table showing: (1) payment terms, (2) liability caps, (3) termination clauses, and (4) any non-standard provisions. Then list the three contracts with the most favorable terms and explain why.

Open in a Space →

Research Literature Synthesis

I'm providing 25 recent papers on [topic]. Synthesize the findings into: (1) three points of broad consensus, (2) two major contradictions or debates, (3) three under-researched areas, and (4) the most-cited methodological approach. Cite paper titles when referencing specific claims.

Open in a Space →

Full-Day Meeting Analysis

Below are transcripts from today's six back-to-back meetings. Extract: (1) all decisions made with owner names, (2) action items with deadlines, (3) open questions that need follow-up, and (4) recurring themes across conversations. Organize by meeting, then provide a consolidated priority list.

Open in a Space →

Customer Feedback Clustering

I'm pasting 500 customer support tickets from Q4. Cluster them into thematic categories, then for each category provide: (1) percentage of total volume, (2) average sentiment score, (3) most common specific complaint, and (4) one actionable product change to address it.

Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

Review this product requirements doc for a new checkout flow and flag any ambiguities or missing edge cases before we start implementation.

Output

The PRD clearly defines the happy path but leaves three critical gaps. First, the payment retry logic doesn't specify timeout behavior when the gateway is unresponsive — should we fail immediately or queue for async processing? Second, the inventory reservation section assumes atomic stock checks, but doesn't address race conditions when multiple users checkout simultaneously. Third, the error messaging requirements say 'user-friendly' without defining tone or specificity level for PCI-related failures. I'd also recommend explicit handling for partial refunds in the returns section, since the current spec only covers full-order cancellations.

Notes

This example highlights Palmyra X5's strength in structured document analysis across its massive 1M+ token context window — it can ingest entire PRDs, codebases, or legal documents in one pass. The output demonstrates practical business reasoning rather than generic advice. Trade-off: at $6/Mtok output pricing, verbose responses on large documents add up quickly compared to models with cheaper generation costs.

Prompt

Generate a Python function that takes a list of transaction records and returns summary statistics grouped by merchant category, handling missing data and currency conversion.

Output

```python from collections import defaultdict from decimal import Decimal def summarize_transactions(records, exchange_rates=None): """Group transactions by merchant category with stats. Args: records: List of dicts with 'amount', 'currency', 'category' exchange_rates: Optional dict for USD conversion rates """ if exchange_rates is None: exchange_rates = {} stats = defaultdict(lambda: {'count': 0, 'total_usd': Decimal('0')}) for record in records: category = record.get('category', 'uncategorized') amount = Decimal(str(record.get('amount', 0))) currency = record.get('currency', 'USD') rate = exchange_rates.get(currency, Decimal('1')) stats[category]['count'] += 1 stats[category]['total_usd'] += amount * rate return dict(stats) ```

Notes

Palmyra X5 produces working code with sensible defaults and defensive programming patterns — note the explicit Decimal handling and graceful missing-data fallbacks. The function signature and docstring show enterprise-grade habits. However, the model doesn't automatically suggest optimizations like vectorization or database-level aggregation that a specialized code model might flag for production scale.

Prompt

Write a 200-word executive summary of our Q3 performance for the board deck, emphasizing the EMEA expansion results and the shift in customer acquisition cost trends.

Output

Q3 marked a strategic inflection point for our EMEA operations, with revenue growing 47% year-over-year to €8.2M — now representing 22% of total company revenue versus 14% in Q2. The Frankfurt and London hubs reached profitability two quarters ahead of plan, driven by enterprise contract wins in financial services (€3.1M in new ARR). Customer acquisition cost declined 31% quarter-over-quarter to €1,840 per logo, reflecting improved channel partner performance and organic inbound growth from our September product launch. However, this efficiency gain came with a trade-off: average contract value in EMEA remains 18% below our North American benchmark, suggesting we're capturing mid-market accounts faster than enterprise deals. Net revenue retention held at 118%, consistent with Q2, indicating healthy expansion motion within the existing base. Looking forward, the pipeline suggests Q4 EMEA bookings will exceed €12M if current close rates hold, positioning the region to hit our 25% revenue mix target by year-end.

Notes

This example demonstrates Palmyra X5's ability to synthesize quantitative data into narrative form with appropriate executive framing — balancing optimism with honest trade-off acknowledgment. The 1M+ token context means it could generate this from raw spreadsheets and meeting notes in a single prompt. The writing is direct and metric-dense, though some users may prefer more varied sentence structure for presentation decks.

Use-case deep-dives

Enterprise document synthesis

When Palmyra X5 handles multi-document policy review at scale

A 12-person compliance team at a mid-market fintech needs to cross-reference regulatory filings against internal policies every quarter—typically 400-600 pages per review cycle. Palmyra X5's 1.04M token context window means you can load entire policy manuals, state filings, and audit trails in a single prompt without chunking or retrieval overhead. At $0.60 per million input tokens, a 500K-token review costs $0.30 in input fees, making it viable for recurring quarterly work where GPT-4 Turbo's smaller window would force multiple calls and higher coordination costs. The trade-off: output runs $6.00 per Mtok, so if your team generates 50K+ token summaries per review, Claude 3.5 Sonnet becomes cheaper on the output side. Use Palmyra X5 when input volume dominates and you need everything in one context.

Legal contract comparison

Why Palmyra X5 wins for side-by-side vendor agreement analysis

A 4-person procurement team at a SaaS company reviews 20-30 vendor contracts per month, each 40-80 pages, looking for liability caps, indemnification clauses, and renewal terms that deviate from their standard template. Palmyra X5 lets you load the master template plus three vendor drafts (200K tokens total) in one call, then ask for a clause-by-clause diff without managing a vector database or multi-step retrieval flow. The $0.60 input rate means a 200K-token comparison costs $0.12, and the model returns structured tables or JSON at $6.00/Mtok output—reasonable when your output is a 5K-token summary table. If you're doing this daily at higher volume, the output cost adds up; at 100+ contracts/month, consider a fine-tuned smaller model. For 20-30/month with complex cross-referencing, Palmyra X5 is the cleanest path.

Research literature review

When Palmyra X5 accelerates academic meta-analysis for small labs

A 3-person biotech research team needs to synthesize findings from 40 recent papers on CRISPR delivery mechanisms—each paper is 8-12K tokens of full text. Palmyra X5's context window fits all 40 papers (400K tokens) in one prompt, letting the team ask cross-study questions like 'which delivery vectors showed >70% efficiency in vivo?' without pre-processing or embeddings infrastructure. At $0.60 per million input tokens, loading 400K tokens costs $0.24, and a 10K-token synthesis runs $0.06 in output fees—trivial for a quarterly literature review. The caveat: without public benchmarks, you're trusting Writer's internal evals for scientific accuracy. If citation precision is mission-critical, validate outputs against a known-good model on a 10-paper subset before committing to the full workflow. For exploratory synthesis where speed matters more than perfection, Palmyra X5 delivers.

Frequently asked

Is Palmyra X5 good for long document analysis?

Yes. With a 1.04M token context window, Palmyra X5 handles entire codebases, legal contracts, or research papers in a single prompt. That's roughly 780,000 words — enough for most enterprise document workflows without chunking or retrieval systems. The massive context makes it ideal for summarization, Q&A, and cross-referencing tasks where you need the full document in memory.

Is Palmyra X5 cheaper than GPT-4o or Claude Sonnet?

Input costs are competitive at $0.60/Mtok (vs GPT-4o's $2.50 and Sonnet 4's $3.00), but output is expensive at $6.00/Mtok — double GPT-4o and triple Sonnet 4. For read-heavy tasks like document analysis or search, Palmyra X5 wins on price. For generation-heavy work like writing or coding, you'll pay more per response than alternatives.

Can Palmyra X5 handle the full 1M+ context reliably?

Writer claims 1.04M tokens, but without public benchmarks like RULER or needle-in-haystack tests, real-world performance is unverified. Most models degrade past 200-300K tokens despite advertised limits. Test your specific use case at scale before committing — especially for tasks requiring precise retrieval from the middle of long contexts.

How does Palmyra X5 compare to GPT-4 Turbo for enterprise use?

Palmyra X5 offers 4x the context window of GPT-4 Turbo (1.04M vs 128K) and cheaper input costs, making it better for document-heavy workflows. However, GPT-4 Turbo has proven benchmarks across reasoning, coding, and instruction-following. Without public eval scores, Palmyra X5 is a gamble unless Writer's private benchmarks align with your specific domain.

Should I use Palmyra X5 for customer-facing chatbots?

Only if you need the massive context window for session history or knowledge retrieval. The $6.00/Mtok output cost makes conversational AI expensive — a 500-token response costs 0.3 cents, adding up fast at scale. For standard chat without long-context needs, GPT-4o mini ($0.60 output) or Sonnet 4 ($15 output) offer better economics and proven performance.