Gemini 3.5 Flash pricing
Google's May 2026 fast model. At $1.50/$9 per 1M tokens it undercuts Gemini 3.1 Pro by about 25% while beating it on coding and agentic benchmarks.
Input
$1.50/ 1M tok
Output
$9.00/ 1M tok
- Context window
- 1M
- Max output
- 66K
- Cached input
- -
- Verified
- 2026-07-01
Standard vs batch
| Tier | Input / 1M | Output / 1M | Cached in / 1M |
|---|---|---|---|
Standard Real-time | $1.50 | $9.00 | - |
Batch mode Up to 24h turnaround, 50% off | $0.75 | $4.50 | - |
Batch mode runs every Gemini model at 50% of list price with a turnaround of up to 24 hours.
Gemini 3.5 Flash launched on 19 May 2026 as Google's new fast-tier model. At $1.50 per 1M input tokens and $9.00 per 1M output tokens it lands about 25% below Gemini 3.1 Pro ($2.00 / $12.00) while beating it on coding and agentic benchmarks – a rare case where the cheaper model is also the stronger one for a wide class of tasks.
For high-volume or latency-tolerant work, batch mode halves those rates to $0.75 / $4.50 per 1M tokens with a turnaround of up to 24 hours. Combined with a 1M-token context window, that makes 3.5 Flash a strong default for RAG pipelines, document processing, and agent loops where you would otherwise reach for a Pro-tier model.
If you are on Gemini 3.1 Pro today, 3.5 Flash is worth benchmarking on your own workload: for many coding and tool-use tasks it is both cheaper and better, and the long-context envelope is unchanged.
Estimate LLM costs before you send
Paste your prompt into the Calcis estimator to see token counts and per-request cost across every tracked model, then compare Gemini 3.5 Flash against them side by side.
Frequently asked
- How much does Gemini 3.5 Flash cost per 1M tokens?
- $1.50 for input and $9.00 for output per 1M tokens at standard rates. Batch mode halves both to $0.75 / $4.50.
- Is Gemini 3.5 Flash cheaper than Gemini 3.1 Pro?
- Yes. At $1.50/$9 it is roughly 25% below Gemini 3.1 Pro's $2.00/$12, and it beats 3.1 Pro on coding and agentic benchmarks.
- What is the Gemini 3.5 Flash context window?
- 1M tokens, in line with the rest of the Gemini 3.x Flash line.
- How much does batch mode save?
- Batch mode runs at 50% of the standard list price ($0.75 input / $4.50 output per 1M tokens) with a turnaround of up to 24 hours.
- When should I pick 3.5 Flash over a Pro model?
- For most coding, RAG, and agentic workloads 3.5 Flash is both cheaper and competitive on quality. Reserve Pro-tier models for tasks where you have measured a quality gap on your own prompts.
Pricing verified 2026-07-01 from the provider's rate card. These figures are informational and not yet wired into the Calcis estimator or billing.