Question 1

Which is cheaper, Claude Sonnet 4.5 or Gemini 2.5 Pro?

Accepted Answer

On a typical 1,000-input / 2,000-output request, Gemini 2.5 Pro costs ~$0.0213 vs ~$0.0330 on Claude Sonnet 4.5. Input or output rates can flip the answer for very lopsided workloads - see the cost ladder above.

Question 2

What's the difference in per-token pricing?

Accepted Answer

Claude Sonnet 4.5 charges $3.00 per 1M input tokens and $15.00 per 1M output tokens. Gemini 2.5 Pro charges $1.25 / $10.00 per 1M.

Question 3

Which has the bigger context window?

Accepted Answer

Gemini 2.5 Pro is larger (2M) vs 200K on the other.

Question 4

Is there a cached-input discount on either?

Accepted Answer

Claude Sonnet 4.5 does not publish a cached-input rate. Gemini 2.5 Pro caches at $0.125 per 1M (90% off). Workloads with repeated static prefixes see the biggest savings.

Question 5

Does Gemini 2.5 Pro have a long-context surcharge?

Accepted Answer

Yes. Above 200K input tokens, Gemini 2.5 Pro bills at $2.50 input / $15.00 output per 1M instead of the standard rate.

Question 6

How fresh is this comparison?

Accepted Answer

Claude Sonnet 4.5 was re-verified on 2026-04-06 and Gemini 2.5 Pro on 2026-04-06 against each provider's published rate card. Calcis re-checks every row on a rolling schedule and re-deploys when a provider changes pricing.

Scenario	Tokens (in / out)	Claude Sonnet 4.5	Gemini 2.5 Pro	Winner
Short prompt	100 / 200	$0.0033	$0.0021	Gemini 2.5 Pro
Typical request	1,000 / 2,000	$0.0330	$0.0213	Gemini 2.5 Pro
Long document	10,000 / 5,000	$0.1050	$0.0625	Gemini 2.5 Pro
Large prompt	100,000 / 10,000	$0.4500	$0.2250	Gemini 2.5 Pro

Traffic	Req / month	Claude Sonnet 4.5	Gemini 2.5 Pro	Delta
Small SaaS	1,000	$33.00	$21.25	Gemini 2.5 Pro -$11.75
Growing product	10,000	$330.00	$212.50	Gemini 2.5 Pro -$117.50
Heavy usage	100,000	$3,300	$2,125	Gemini 2.5 Pro -$1,175

Claude Sonnet 4.5 vs Gemini 2.5 Pro

Claude Sonnet 4.5

Gemini 2.5 Pro

Cost per request

Monthly bill at scale

Which should you use?

Live cost calculator

Try both in the estimator →

Frequently asked