LLMpoolside

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

Anyone in the Space can @-mention Poolside: Laguna XS.2 (free) with the team's shared context - pooled credits, one chat, one memory.

All models

Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Laguna XS.2 is Poolside's free-tier code model with a 262K context window, making it viable for reading entire codebases without cost. Performance benchmarks aren't public yet, so expect this to lag behind established code models like Claude or GPT-4o on complex reasoning tasks. Reach for it when you need zero-cost code exploration across large repositories or when prototyping workflows before committing to paid tiers.

Best for

Zero-cost codebase exploration
Large repository context analysis
Prototyping code workflows before scaling
Learning prompts for code tasks

Strengths

The 262K context window handles entire medium-sized codebases in a single prompt, and the zero-dollar pricing removes friction for experimentation. This makes it practical for students, open-source maintainers, or teams validating use cases before budget allocation. The model is purpose-built for code, so it understands syntax and structure better than general-purpose free alternatives.

Trade-offs

Without public benchmarks, you're flying blind on accuracy relative to Claude Sonnet or GPT-4o. Expect weaker performance on multi-step reasoning, refactoring across files, or generating production-ready code from vague requirements. The proprietary license means no self-hosting or fine-tuning. Free tiers often come with rate limits or deprioritized inference, though specifics aren't documented here.

Specifications

Provider: poolside
Category: llm
Context length: 262,144 tokens
Max output: 32,768 tokens
Modalities: text
License: proprietary
Released: 2026-04-28

Pricing

Input: $0.00/Mtok
Output: $0.00/Mtok
Model ID: poolside/laguna-xs.2:free

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.

Team cost calculator

Seats5 peopleMessages / seat / day80Avg turn size2 ktokOutput share30 %

Estimated monthly spend

Freeno token cost

17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.

Providers

Provider	Context	Input	Output	P50 latency	Throughput	30d uptime
poolside	262k	$0.00/Mtok	$0.00/Mtok	—	—	—

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Map Repository Structure

Analyze this repository and describe its architecture: list the main modules, their responsibilities, and how data flows between them. Focus on the top-level structure, not implementation details.

Open in a Space →

Find Cross-File Dependencies

Trace all dependencies for the `processPayment` function across these files. Show which modules call it, what data it requires, and any side effects it triggers.

Open in a Space →

Explain Legacy Code Block

Explain what this code block does, why it might be structured this way, and any potential issues or improvements you'd suggest. Assume I'm unfamiliar with this part of the codebase.

Open in a Space →

Generate Test Cases from Code

Write unit tests for this function. Cover the happy path, edge cases, and error conditions. Use the testing framework already present in the repo.

Open in a Space →

Refactor for Readability

Refactor this code to improve readability and maintainability. Keep the same functionality but simplify logic, improve naming, and add comments where helpful.

Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

Review this Python function for potential bugs. It's supposed to merge two sorted lists but users report incorrect output when one list is empty.

Output

The function likely fails to handle the edge case where one input list is empty from the start. When `list_a` or `list_b` is `[]`, the while loop condition `while i < len(list_a) and j < len(list_b)` immediately evaluates false, skipping the merge logic entirely. The remaining elements from the non-empty list never get appended. Add these lines after your main loop: `result.extend(list_a[i:])` and `result.extend(list_b[j:])` to capture any leftover elements from either list.

Notes

Laguna XS.2 identifies the root cause quickly and provides a concrete fix. The 131k token context window means you can paste entire modules for review. As a free tier model, response latency may vary during peak hours, and complex architectural questions might need more guidance than single-function debugging.

Prompt

Generate a SQL query to find customers who made purchases in both Q1 and Q2 2024 but not Q3, joining orders and customers tables.

Output

```sql SELECT DISTINCT c.customer_id, c.name FROM customers c INNER JOIN orders o ON c.customer_id = o.customer_id WHERE o.order_date BETWEEN '2024-01-01' AND '2024-06-30' AND c.customer_id NOT IN ( SELECT customer_id FROM orders WHERE order_date BETWEEN '2024-07-01' AND '2024-09-30' ) GROUP BY c.customer_id, c.name HAVING COUNT(DISTINCT QUARTER(o.order_date)) = 2; ``` This finds customers with orders in exactly two quarters (Q1 and Q2) while excluding anyone who ordered in Q3.

Notes

The model produces syntactically correct SQL with appropriate join logic and subquery structure. The HAVING clause correctly enforces the two-quarter requirement. However, the query assumes MySQL's QUARTER() function—dialect-specific syntax isn't always flagged. For production queries involving large tables, you'd want to discuss indexing strategy separately.

Prompt

Explain how React's useEffect cleanup function works and when I should return one. Give a practical example with event listeners.

Output

The cleanup function in `useEffect` runs before the component unmounts or before the effect re-runs (if dependencies change). Return a cleanup function whenever your effect creates a subscription, timer, or listener that needs manual teardown. Example: ```javascript useEffect(() => { const handleResize = () => setWidth(window.innerWidth); window.addEventListener('resize', handleResize); return () => window.removeEventListener('resize', handleResize); }, []); ``` Without the cleanup, each re-render would add another listener, causing memory leaks. The returned function removes the listener when the component unmounts or before adding a new one.

Notes

Laguna XS.2 explains the concept clearly with a realistic code sample that demonstrates both the problem and solution. The empty dependency array detail shows attention to common pitfalls. Being a free model, it's well-suited for learning scenarios and documentation tasks, though advanced framework patterns (Suspense, Server Components) may require follow-up prompts for depth.

Use-case deep-dives

Prototype code generation testing

When free 131K context beats paid models for early-stage dev

A 4-person startup building an MVP needs to test code generation workflows without burning runway on API costs. Laguna XS.2 gives you 131,072 tokens of context at $0.00/Mtok—enough to feed entire module codebases and iterate on architectural decisions without watching the meter. The lack of public benchmarks means you're flying blind on accuracy versus GPT-4 or Claude, but for throwaway prototypes where you're validating ideas rather than shipping production code, free context at this scale changes the math. Run 500 generation attempts in a weekend to find the right prompt structure, then graduate to a paid model once you know what works. If you need guaranteed correctness or you're past the experimentation phase, switch to a benchmarked alternative.

Student project documentation review

Free long-context analysis for academic teams with zero budget

A university capstone team of 8 students needs to review 40-page research papers and generate structured summaries for their thesis work. Laguna XS.2's 131K token window handles full academic papers in one shot, and the zero-cost pricing means the team can process hundreds of documents without grant funding or credit card limits. You're trading benchmark transparency for accessibility—there's no HumanEval score or MMLU number to validate quality, so expect to manually verify outputs against source material. For literature reviews, citation extraction, and rough-draft summarization where a human is always in the loop, this model removes the financial barrier. Once the project secures funding or moves to publication-quality work, migrate to a model with published accuracy metrics.

Internal chatbot cost exploration

Testing conversational AI economics before committing to paid tiers

A 12-person operations team wants to build a Slack bot that answers FAQ questions from a 60-page employee handbook, but leadership won't approve a $200/month AI budget without proof of concept. Laguna XS.2 lets you load the entire handbook into context and run 2 weeks of real Slack traffic at zero cost to measure adoption and refine prompts. The missing benchmarks mean you can't predict hallucination rates or compare reasoning quality to GPT-3.5, so you'll need to spot-check answers manually during the trial. If the bot gets 50+ queries/day and the team trusts it, the usage data justifies upgrading to a paid model with published safety scores. If adoption stays under 20 queries/day, you've validated the idea without spending a dollar.

Frequently asked

Is Poolside Laguna XS.2 good for coding tasks?

Laguna XS.2 is built by Poolside specifically for code generation and understanding, so yes — it handles typical coding workflows well. With 131k context window, it can process entire codebases for refactoring or debugging. No public benchmarks exist yet, so you're testing blind compared to established models like GPT-4 or Claude. The free pricing makes experimentation risk-free.

How does Laguna XS.2 pricing compare to other code models?

It's completely free — $0 per million tokens in and out. That undercuts every commercial alternative including GPT-4o ($2.50/$10), Claude Sonnet ($3/$15), and Codestral ($0.20/$0.60). The catch is lack of proven performance data and potential rate limits or availability constraints that free tiers typically impose. For prototyping or high-volume batch jobs, the economics are unbeatable.

Can Laguna XS.2 handle large codebases with its context window?

The 131k token window fits roughly 100k tokens of code after system prompts — enough for 3-5 medium Python files or a small module's worth of context. That's comparable to GPT-4 Turbo but half of Claude Opus 4's 256k. You can review substantial context, but monorepo-scale analysis still requires chunking strategies or retrieval augmentation.

Is Laguna XS.2 better than the previous Poolside models?

No prior Poolside model versions are documented in our data, so this appears to be an early public release. The "XS.2" naming suggests a smaller or experimental variant. Without benchmark history or a standard Laguna model for comparison, treat this as a first-generation offering. Monitor Poolside's releases for larger or more capable successors.

Should I use Laguna XS.2 for production code generation?

Only if you're comfortable with zero public performance validation and potential service instability. Free models often lack SLAs, uptime guarantees, or consistent response quality. Use it for internal tooling, experimentation, or cost-sensitive batch jobs where you can tolerate failures. For customer-facing features or mission-critical automation, pay for a model with proven reliability like GPT-4o or Codestral.