Cohere: North Mini Code (free)
North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...
Anyone in the Space can @-mention Cohere: North Mini Code (free) with the team's shared context - pooled credits, one chat, one memory.
Starter is free forever - 1 Space, 100 credits/month, 1 MCP. No card.
Verdict
Best for
- Zero-cost code prototyping and experimentation
- Large codebase analysis on tight budgets
- Learning prompt engineering without API costs
- Internal tools with unpredictable usage spikes
Strengths
The 256K context window lets you feed entire repositories or multi-file codebases in a single request, removing the need for chunking strategies. Zero pricing eliminates cost anxiety during development cycles — you can iterate freely without watching token meters. The code-specific tuning means it handles syntax highlighting, function extraction, and basic refactoring tasks competently enough for internal tooling or educational use cases.
Trade-offs
Free models typically lag paid tiers on complex reasoning, nuanced code generation, and edge-case handling. Without public benchmarks, it's unclear how North Mini Code stacks up against GPT-4o-mini or Claude Haiku on standardized coding tests like HumanEval. Expect slower response times or rate limits compared to paid endpoints. The proprietary license means you can't self-host or fine-tune, limiting flexibility for teams wanting model customization.
Specifications
- Provider
- cohere
- Category
- llm
- Context length
- 256,000 tokens
- Max output
- 64,000 tokens
- Modalities
- text
- License
- proprietary
- Released
- 2026-06-17
Pricing
- Input
- $0.00/Mtok
- Output
- $0.00/Mtok
- Model ID
cohere/north-mini-code:free
Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool - one plan, one balance for everyone.
Team cost calculator
5 seats · 80 msgs/day
Switchy meters this against your org's shared credit pool - one plan, one balance for everyone.
Providers
| Provider | Context | Input | Output | P50 latency | Throughput | 30d uptime |
|---|---|---|---|---|---|---|
| cohere | 256k | $0.00/Mtok | $0.00/Mtok | — | — | — |
Performance
Benchmarks
Works well with
Top MCPs
Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.
How Switchy teams use it
Starter prompts
Codebase Documentation Generator
Review the following codebase files and generate a README with sections for architecture overview, key modules, and setup instructions. Focus on clarity for new contributors.Open in a Space →
Multi-File Refactor Suggestions
Analyze these Python files for code duplication, unused imports, and opportunities to extract shared utilities. Provide specific line-number recommendations.Open in a Space →
Dependency Conflict Resolver
Given this package.json and yarn.lock, identify version conflicts and suggest compatible dependency ranges that resolve the issues.Open in a Space →
API Endpoint Mapper
Parse these Express.js route files and generate an API reference table with endpoints, HTTP methods, required parameters, and response shapes.Open in a Space →
Test Coverage Gap Finder
Compare these source modules with the test files and list functions or branches that lack test coverage. Prioritize by complexity.Open in a Space →