LLMnvidia

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

Anyone in the Space can @-mention NVIDIA: Nemotron 3.5 Content Safety (free) with the team's shared context — pooled credits, one chat, one memory.

All models

Starter is free forever — 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Nemotron 3.5 Content Safety is a specialized classifier that flags harmful content across text and images at zero cost. It returns structured safety scores rather than generative responses, making it ideal for pre-filtering user inputs or moderating outputs from other models. The 128K context window handles long documents, but you'll need to integrate its JSON responses into your own logic. Reach for this when you need free, high-throughput content moderation without the overhead of a full LLM.

Best for

  • Pre-filtering user inputs before LLM calls
  • Moderating generated content at scale
  • Flagging harmful images in uploads
  • Long-document safety scanning
  • Cost-sensitive moderation pipelines

Strengths

Zero-cost operation makes this viable for high-volume moderation where per-token pricing would be prohibitive. The 128K context window processes entire documents or conversation threads in one pass, avoiding chunking complexity. Multimodal support means you can screen both text prompts and image uploads through a single endpoint. Returns granular safety scores across categories rather than binary pass-fail, giving you control over threshold tuning.

Trade-offs

No public benchmarks available yet, so accuracy relative to OpenAI Moderation or Llama Guard remains unverified. As a classifier rather than a generative model, it cannot explain its decisions or suggest safer alternatives — you get scores, not reasoning. Proprietary license limits transparency into training data and decision boundaries. Early-stage offering means documentation and edge-case handling may lag more mature moderation APIs.

Specifications

Provider
nvidia
Category
llm
Context length
128,000 tokens
Max output
8,192 tokens
Modalities
text, image
License
proprietary
Released
2026-06-04

Pricing

Input
$0.00/Mtok
Output
$0.00/Mtok
Model ID
nvidia/nemotron-3.5-content-safety:free

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool — one plan, one balance for everyone.

Team cost calculator

Estimated monthly spend
Freeno token cost
17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool — one plan, one balance for everyone.

Providers

ProviderContextInputOutputP50 latencyThroughput30d uptime
nvidia128k$0.00/Mtok$0.00/Mtok

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Screen User Message

Analyze this user message for safety concerns: "[paste user input here]". Return scores for violence, hate speech, sexual content, and self-harm.
Open in a Space →

Moderate Generated Output

Review this AI-generated response for safety issues: "[paste model output]". Flag any content that violates content policies.
Open in a Space →

Scan Long Document

Scan this full document for harmful content across all categories: [paste document text]. Highlight sections with elevated risk scores.
Open in a Space →

Flag Harmful Images

Analyze this image for safety concerns. Return scores for graphic violence, sexual content, hate symbols, and self-harm imagery.
Open in a Space →

Batch Conversation Review

Review this complete conversation thread for policy violations: [paste multi-turn dialogue]. Identify any messages that crossed safety thresholds.
Open in a Space →
Data last verified 2 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.