Name: Gemini 3 Flash (preview)
Brand: Google
Price: 0.5 USD
Availability: InStock

Question 1

Does Gemini 3 Flash have a long-context surcharge?

Accepted Answer

No. The $0.50 / $3 per 1M rates apply across the full 1M context window. Only the Pro tier (3.1 Pro and 2.5 Pro) has dual-tier pricing above 200K input tokens.

Question 2

How much does Gemini 3 Flash cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.002 ($0.0005 input + $0.0015 output). At 1 million requests a month, that's around $2,000 - very cheap for a frontier-class model.

Question 3

Is Gemini 3 Flash cheaper than Gemini 2.5 Flash?

Accepted Answer

No - 2.5 Flash is cheaper at $0.30 / $2.50 per 1M tokens. 3 Flash is a newer preview with different capabilities; pick the preview if you need the newer generation, stay on 2.5 Flash for cost-optimised GA workloads.

Question 4

What tokenizer does Gemini 3 Flash use?

Accepted Answer

Google uses SentencePiece but doesn't publish it for offline use. Calcis calls Google's countTokens API for the authoritative count, which matches the Google billing boundary exactly.

Gemini 3 Flash (preview) pricing

Estimate your cost on Gemini 3 Flash (preview)

Frequently asked