How much does Gemini 3.5 Flash cost per 1M tokens?

$1.50 for input and $9.00 for output per 1M tokens at standard rates. Batch mode halves both to $0.75 / $4.50.

Is Gemini 3.5 Flash cheaper than Gemini 3.1 Pro?

Yes. At $1.50/$9 it is roughly 25% below Gemini 3.1 Pro's $2.00/$12, and it beats 3.1 Pro on coding and agentic benchmarks.

What is the Gemini 3.5 Flash context window?

1M tokens, in line with the rest of the Gemini 3.x Flash line.

How much does batch mode save?

Batch mode runs at 50% of the standard list price ($0.75 input / $4.50 output per 1M tokens) with a turnaround of up to 24 hours.

When should I pick 3.5 Flash over a Pro model?

For most coding, RAG, and agentic workloads 3.5 Flash is both cheaper and competitive on quality. Reserve Pro-tier models for tasks where you have measured a quality gap on your own prompts.

Google

New

Gemini 3.5 Flash pricing

Name: Gemini 3.5 Flash
Brand: Google
Price: 1.5 USD
Availability: InStock

Google's May 2026 fast model. At $1.50/$9 per 1M tokens it undercuts Gemini 3.1 Pro by about 25% while beating it on coding and agentic benchmarks.

Input

$1.50/ 1M tok

Output

$9.00/ 1M tok

Context window: 1M
Max output: 66K
Cached input: -
Verified: 2026-07-01

Standard vs batch

Tier	Input / 1M	Output / 1M	Cached in / 1M
Standard Real-time	$1.50	$9.00	-
Batch mode Up to 24h turnaround, 50% off	$0.75	$4.50	-

Batch mode runs every Gemini model at 50% of list price with a turnaround of up to 24 hours.

Gemini 3.5 Flash launched on 19 May 2026 as Google's new fast-tier model. At $1.50 per 1M input tokens and $9.00 per 1M output tokens it lands about 25% below Gemini 3.1 Pro ($2.00 / $12.00) while beating it on coding and agentic benchmarks – a rare case where the cheaper model is also the stronger one for a wide class of tasks.

For high-volume or latency-tolerant work, batch mode halves those rates to $0.75 / $4.50 per 1M tokens with a turnaround of up to 24 hours. Combined with a 1M-token context window, that makes 3.5 Flash a strong default for RAG pipelines, document processing, and agent loops where you would otherwise reach for a Pro-tier model.

If you are on Gemini 3.1 Pro today, 3.5 Flash is worth benchmarking on your own workload: for many coding and tool-use tasks it is both cheaper and better, and the long-context envelope is unchanged.

Estimate LLM costs before you send

Paste your prompt into the Calcis estimator to see token counts and per-request cost across every tracked model, then compare Gemini 3.5 Flash against them side by side.

Open estimator →Compare all models →

Frequently asked

How much does Gemini 3.5 Flash cost per 1M tokens?: $1.50 for input and $9.00 for output per 1M tokens at standard rates. Batch mode halves both to $0.75 / $4.50.
Is Gemini 3.5 Flash cheaper than Gemini 3.1 Pro?: Yes. At $1.50/$9 it is roughly 25% below Gemini 3.1 Pro's $2.00/$12, and it beats 3.1 Pro on coding and agentic benchmarks.
What is the Gemini 3.5 Flash context window?: 1M tokens, in line with the rest of the Gemini 3.x Flash line.
How much does batch mode save?: Batch mode runs at 50% of the standard list price ($0.75 input / $4.50 output per 1M tokens) with a turnaround of up to 24 hours.
When should I pick 3.5 Flash over a Pro model?: For most coding, RAG, and agentic workloads 3.5 Flash is both cheaper and competitive on quality. Reserve Pro-tier models for tasks where you have measured a quality gap on your own prompts.

Pricing verified 2026-07-01 from the provider's rate card. These figures are informational and not yet wired into the Calcis estimator or billing.