Name: Gemini 3.1 Flash-Lite (preview)
Brand: Google
Price: 0.25 USD
Availability: InStock

Question 1

How much does Gemini 3.1 Flash-Lite cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.001. At one million requests a month, that's around $1,000 in API fees - cheaper than the infrastructure that calls it for most teams.

Question 2

Does Flash-Lite have a long-context surcharge?

Accepted Answer

No. The $0.25 / $1.50 per 1M rates apply across the full 1M context window. Long-context surcharges only exist on the Pro tier (3.1 Pro, 2.5 Pro) above 200K input tokens.

Question 3

Is Flash-Lite cheaper than GPT-5 nano?

Accepted Answer

GPT-5 nano is cheaper on input ($0.05 vs $0.25) but more expensive on cached input ($0.005 is cheaper, but there's no caching for Flash-Lite's full rate). Output is close - $0.40 (GPT-5 nano) vs $1.50 (Flash-Lite). Pick on capability and ecosystem.

Question 4

What tokenizer does Flash-Lite use?

Accepted Answer

Google's proprietary SentencePiece variant. Calcis uses Google's countTokens API for the authoritative count - the same boundary Google uses to bill.

Gemini 3.1 Flash-Lite (preview) pricing

Estimate your cost on Gemini 3.1 Flash-Lite (preview)

Frequently asked