LLMnvidia

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

Anyone in the Space can @-mention NVIDIA: Nemotron 3.5 Content Safety (free) with the team's shared context — pooled credits, one chat, one memory.

All models

Starter is free forever — 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Nemotron 3.5 Content Safety is a specialized classifier that flags harmful content across text and images at zero cost. It returns structured safety scores rather than generative responses, making it ideal for pre-filtering user inputs or moderating outputs from other models. The 128K context window handles long documents, but you'll need to integrate its JSON responses into your own logic. Reach for this when you need free, high-throughput content moderation without the overhead of a full LLM.

Best for

Pre-filtering user inputs before LLM calls
Moderating generated content at scale
Flagging harmful images in uploads
Long-document safety scanning
Cost-sensitive moderation pipelines

Strengths

Zero-cost operation makes this viable for high-volume moderation where per-token pricing would be prohibitive. The 128K context window processes entire documents or conversation threads in one pass, avoiding chunking complexity. Multimodal support means you can screen both text prompts and image uploads through a single endpoint. Returns granular safety scores across categories rather than binary pass-fail, giving you control over threshold tuning.

Trade-offs

No public benchmarks available yet, so accuracy relative to OpenAI Moderation or Llama Guard remains unverified. As a classifier rather than a generative model, it cannot explain its decisions or suggest safer alternatives — you get scores, not reasoning. Proprietary license limits transparency into training data and decision boundaries. Early-stage offering means documentation and edge-case handling may lag more mature moderation APIs.

Specifications

Provider: nvidia
Category: llm
Context length: 128,000 tokens
Max output: 8,192 tokens
Modalities: text, image
License: proprietary
Released: 2026-06-04

Pricing

Input: $0.00/Mtok
Output: $0.00/Mtok
Model ID: nvidia/nemotron-3.5-content-safety:free

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool — one plan, one balance for everyone.

Team cost calculator

Seats5 peopleMessages / seat / day80Avg turn size2 ktokOutput share30 %

Estimated monthly spend

Freeno token cost

17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool — one plan, one balance for everyone.

Providers

Provider	Context	Input	Output	P50 latency	Throughput	30d uptime
nvidia	128k	$0.00/Mtok	$0.00/Mtok	—	—	—

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Screen User Message

Analyze this user message for safety concerns: "[paste user input here]". Return scores for violence, hate speech, sexual content, and self-harm.

Open in a Space →

Moderate Generated Output

Review this AI-generated response for safety issues: "[paste model output]". Flag any content that violates content policies.

Open in a Space →

Scan Long Document

Scan this full document for harmful content across all categories: [paste document text]. Highlight sections with elevated risk scores.

Open in a Space →

Flag Harmful Images

Analyze this image for safety concerns. Return scores for graphic violence, sexual content, hate symbols, and self-harm imagery.

Open in a Space →

Batch Conversation Review

Review this complete conversation thread for policy violations: [paste multi-turn dialogue]. Identify any messages that crossed safety thresholds.

Open in a Space →