LLMgryphe

MythoMax 13B

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

Anyone in the Space can @-mention MythoMax 13B with the team's shared context - pooled credits, one chat, one memory.

All models

Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

MythoMax 13B is a creative writing specialist built by merging two storytelling-focused models. It excels at generating fiction, dialogue, and roleplay scenarios with consistent character voices and narrative flow. The 4K context window limits long-form work, and lack of public benchmarks means you're flying blind on reasoning tasks. Reach for this when you need imaginative prose on a tight budget and can work within short contexts.

Best for

  • Creative fiction and storytelling
  • Character-driven dialogue generation
  • Roleplay scenario development
  • Budget-conscious narrative tasks

Strengths

MythoMax merges MythoLogic and Huginn models to prioritize narrative coherence and character consistency. At $0.06/Mtok both ways, it undercuts GPT-4 class models by 500x while maintaining strong creative output. The 13B parameter count keeps inference fast without sacrificing the nuanced prose that larger creative models deliver. Users report it handles multi-character scenes and maintains distinct voices better than base Llama models at this size.

Trade-offs

The 4096-token window chokes on anything beyond short stories or single scenes — you cannot feed it novel chapters or maintain long conversation threads. No public benchmark data exists for reasoning, math, or code tasks, so performance outside creative writing is unverified. The proprietary license restricts commercial deployment options compared to open alternatives like Llama 2 13B. Expect weaker factual accuracy than instruction-tuned models.

Specifications

Provider
gryphe
Category
llm
Context length
4,096 tokens
Max output
4,096 tokens
Modalities
text
License
proprietary
Released
2023-07-02

Pricing

Input
$0.06/Mtok
Output
$0.06/Mtok
Model ID
gryphe/mythomax-l2-13b

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.

Team cost calculator

Estimated monthly spend
$1.06
17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.

Providers

ProviderContextInputOutputP50 latencyThroughput30d uptime
gryphe4k$0.06/Mtok$0.06/Mtok

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Character Dialogue Scene

Write a tense dialogue between a skeptical detective and a charming suspect in a dimly lit interrogation room. Show their personalities through word choice and rhythm, not exposition.
Open in a Space →

Fantasy World Building

Describe a bustling marketplace in a desert city where magic and technology coexist. Focus on sights, sounds, and the tension between tradition and progress.
Open in a Space →

Short Story Opening

Write the opening 300 words of a science fiction story where a ship's AI begins questioning its programming. Establish mood and stakes immediately.
Open in a Space →

Character Backstory

Create a detailed backstory for a retired thief who now runs a bookshop. Include a formative event that explains why they left their old life.
Open in a Space →

Roleplay Response

You are a cunning merchant in a medieval fantasy setting. A customer asks about a mysterious amulet in your shop. Respond in character, hinting at secrets without revealing too much.
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

Write a short fantasy scene where a merchant tries to sell a cursed amulet to an unsuspecting traveler. Make it atmospheric and include subtle foreshadowing.

Output

The merchant's stall hunched between two crumbling buildings like a secret waiting to be told. Candles guttered in brass holders, casting shadows that seemed to move independently of their flames. "This amulet," the merchant whispered, his fingers tracing the tarnished silver, "belonged to the Duchess of Ashenmoor. See how it catches the light?" The traveler leaned closer. The pendant did catch light—but seemed to absorb it too, creating a darkness at its center that pulsed faintly. "Beautiful craftsmanship," the merchant continued, though his eyes never quite met the traveler's. "The duchess wore it every day until... well, until she didn't need it anymore."

Notes

MythoMax excels at creative fiction with rich atmospheric detail and character voice. This example shows its strength in genre writing—particularly fantasy and roleplay scenarios—where it maintains consistent tone and weaves in narrative elements naturally. The 4096-token context means longer stories require careful chapter breaks.

Prompt

I'm running a D&D campaign and need three unique tavern NPCs with distinct personalities, motivations, and a secret each. Make them memorable but not clichéd.

Output

**Merra Copperwhisper** - The tavern's cook, a halfling who hums constantly while working. She's genuinely kind but obsessively collects rumors, writing them in a cipher only she understands. Secret: She's mapping every criminal network in the city, planning to sell the information to the highest bidder when her daughter's medical debts come due. **Thorne** - A scarred human bouncer who speaks in single sentences. Respected for fairness, feared for follow-through. Secret: He's illiterate and terrified someone will discover it, especially the bard he's falling for. **"Lucky" Vex** - An elderly gnome who claims to have survived six dragon encounters. Loud, drinks too much, tips poorly. Secret: Every story is true, and he's dying from a curse that only activates when he's genuinely happy.

Notes

This showcases MythoMax's origin as a roleplay-focused merge. It generates distinctive character voices and hooks that invite player interaction. The model understands narrative structure for tabletop gaming contexts. However, at 13B parameters, it sometimes relies on familiar archetypes when pushed for volume—quality drops after generating 5-6 NPCs in one session.

Prompt

Rewrite this customer service email to sound more empathetic while keeping the same information: 'Your refund request has been denied because you exceeded the 30-day return window. Policy is on our website.'

Output

Thank you for reaching out about your recent purchase. I understand how frustrating it is when something doesn't work out as expected. I've reviewed your account and can see that your purchase was made 37 days ago. Our return policy allows refunds within 30 days of purchase, which means your request falls just outside that window. I realize this isn't the answer you were hoping for. While I'm unable to process a refund in this case, I'd like to explore other options that might help. Would you be open to discussing store credit or exchanging the item for something else? I'm here to help find a solution that works for you.

Notes

MythoMax handles tone-shifting tasks competently, adding warmth and structure to blunt messages. It understands emotional context and can reframe information diplomatically. The output is natural and professional. That said, this isn't the model's primary strength—dedicated instruction-tuned models will produce more consistent business writing with fewer runs needed to get the right formality level.

Use-case deep-dives

Creative fiction brainstorming

When MythoMax 13B handles collaborative story development for small writing teams

A 4-person indie game studio needs to generate NPC dialogue and quest branches without burning budget on frontier models. MythoMax 13B at $0.06/Mtok each direction makes it viable to iterate 50+ dialogue trees per sprint—you're looking at roughly $3-5/month even with heavy use. The 4096-token window covers most single-scene exchanges but forces you to chunk longer narrative arcs, which actually helps maintain coherent character voices across branches. This model was fine-tuned specifically for creative and roleplay scenarios, so it handles tone shifts and character consistency better than general-purpose alternatives at this price point. If your team is drafting more than 200k tokens/day or needs 8k+ context for novel-length continuity, you'll hit the window ceiling fast and should budget up to a larger context model.

Internal chatbot prototyping

MythoMax 13B as the fast-iteration layer for conversational UI testing

A 3-person SaaS team is prototyping a customer onboarding bot and needs to test 15-20 conversation flows per week before committing to a production model. At $0.06/Mtok symmetrical pricing, MythoMax 13B lets you run hundreds of test conversations for under $2, making it a low-friction sandbox for prompt engineering and flow logic. The 4096-token context handles typical onboarding exchanges (usually 8-12 turns) without truncation, and the model's conversational fine-tuning means responses feel natural enough to validate UX assumptions. You're trading benchmark performance for iteration speed—this isn't the model you'd deploy to production for a fintech app, but it's exactly right for the "does this flow make sense" phase. Once you've locked the conversation structure, migrate to a benchmark-proven model for the live deployment.

Tabletop RPG session assistance

Why MythoMax 13B works for real-time dungeon master support at the table

A weekly D&D group uses Switchy during sessions to generate NPC reactions, improvised plot hooks, and environmental descriptions on the fly. MythoMax 13B's creative fine-tuning and $0.06/Mtok pricing mean the DM can fire off 40-60 prompts per 4-hour session (roughly $0.50-1.00 total cost) without worrying about budget. The 4096-token window holds the current scene context plus 2-3 recent player actions, which is enough for coherent in-the-moment responses but not long campaign continuity—you'll need to manually feed in character backstories each session. The model's roleplay training makes it better at maintaining fantasy tone and NPC personality than general-purpose alternatives, and sub-second response times keep the game moving. If you're running a podcast-quality actual-play show where narrative consistency across 30+ episodes matters, you need a larger context window and likely a reasoning-focused model instead.

Frequently asked

Is MythoMax 13B good for creative writing and roleplay?

Yes. MythoMax 13B was specifically tuned for creative writing, storytelling, and character roleplay scenarios. It's a merge of MythoLogic and Huginn models optimized for narrative coherence and character consistency. The 4096-token context window handles most short-form fiction and multi-turn conversations without issue.

Is MythoMax 13B cheaper than GPT-3.5 Turbo?

Yes, significantly. At $0.06 per Mtok for both input and output, MythoMax costs about 25x less than GPT-3.5 Turbo's $1.50 output pricing. For high-volume creative applications where you need consistent character voices without bleeding-edge reasoning, the cost difference adds up fast.

Can MythoMax 13B handle technical documentation or code?

No, use a code-specialized model instead. MythoMax is a creative-writing merge with no particular coding benchmarks. For technical tasks, you'll get better results from CodeLlama 13B or Phind-CodeLlama at similar parameter counts, or switch to a frontier model if accuracy matters.

How does MythoMax 13B compare to newer Llama 3 models?

MythoMax is a Llama 2-based merge from 2023, so it lacks the instruction-following improvements and broader knowledge of Llama 3. If you need factual accuracy or complex reasoning, use Llama 3 8B. Stick with MythoMax only if you specifically want its creative-writing tuning and can tolerate older training data.

Should I use MythoMax 13B for a chatbot with long conversations?

Only if conversations stay under 4096 tokens total. The context window is small by 2024 standards—about 3000 words including both history and response. For customer support or extended dialogues, you'll hit the limit quickly. Consider GPT-4o-mini or Mistral models with 32k+ windows instead.

Data last verified 8 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.