LLMgoogle

Google: Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Anyone in the Space can @-mention Google: Gemini 3.5 Flash with the team's shared context — pooled credits, one chat, one memory.

All models

Starter is free forever — 1 Space, 100 credits/month, 1 MCP. No card.

Verdict

Gemini 3.5 Flash targets high-throughput workloads where speed and cost matter more than peak reasoning quality. With a 1M token context window and $1.50/Mtok input pricing, it handles massive documents and batch processing efficiently. Expect faster responses than Gemini Pro but weaker performance on complex reasoning tasks. Reach for this when you need to process large volumes of straightforward queries or when latency and cost constraints rule out heavier models.

Best for

  • High-volume batch document processing
  • Cost-sensitive chatbot deployments
  • Rapid prototyping with multimodal inputs
  • Long-context summarization tasks
  • Video and audio transcription workflows

Strengths

The 1M token context window makes this model viable for full-length books, lengthy transcripts, and multi-document analysis without chunking. Multimodal support across text, image, video, and audio enables unified workflows for content processing. Input pricing at $1.50/Mtok undercuts many competitors, making it economical for high-volume applications. Response latency is typically faster than Gemini Pro, which matters for user-facing applications where sub-second replies improve experience.

Trade-offs

Reasoning quality lags behind Gemini Pro and frontier models like Claude Sonnet 4.5 or GPT-4o on complex logic, math, and nuanced instruction-following. Output pricing at $9.00/Mtok climbs quickly for verbose responses, eroding cost advantages in conversational use cases. Without public benchmark data, teams must validate performance on their specific tasks before committing. The model occasionally produces less coherent outputs on multi-step reasoning compared to slower, more capable alternatives.

Specifications

Provider
google
Category
llm
Context length
1,048,576 tokens
Max output
65,536 tokens
Modalities
text, image, video, file, audio
License
proprietary
Released
2026-05-19

Pricing

Input
$1.50/Mtok
Output
$9.00/Mtok
Model ID
google/gemini-3.5-flash

Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool — one plan, one balance for everyone.

Team cost calculator

Estimated monthly spend
$66.00
17.6M tokens / month
5 seats · 80 msgs/day

Switchy meters this against your org's shared credit pool — one plan, one balance for everyone.

Providers

ProviderContextInputOutputP50 latencyThroughput30d uptime
google1049k$1.50/Mtok$9.00/Mtok

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Summarize Long Transcript

Read this full transcript and produce a structured summary with three sections: key decisions made, action items assigned, and unresolved questions. Keep each section under 150 words.
Open in a Space →

Extract Data from Invoice

Extract the following fields from this invoice image: vendor name, invoice number, date, total amount, and line items with quantities. Return as JSON.
Open in a Space →

Classify Support Tickets

Read this support ticket and classify it into one of these categories: billing, technical, account access, feature request, or other. Provide a one-sentence reason for your classification.
Open in a Space →

Generate Product Descriptions

Write a 60-word product description for this item based on the specs provided. Focus on benefits for the target customer and include one call-to-action.
Open in a Space →

Transcribe and Tag Video

Transcribe this video and identify timestamps where the speaker discusses pricing, features, or customer testimonials. Format as a timestamped list.
Open in a Space →
Data last verified 2 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.