Question 1

Which is cheaper, Claude Haiku 4.5 or Gemini 2.5 Pro?

Accepted Answer

On a typical 1,000-input / 2,000-output request, Claude Haiku 4.5 costs ~$0.0110 vs ~$0.0213 on Gemini 2.5 Pro. Input or output rates can flip the answer for very lopsided workloads - see the cost ladder above.

Question 2

What's the difference in per-token pricing?

Accepted Answer

Claude Haiku 4.5 charges $1.00 per 1M input tokens and $5.00 per 1M output tokens. Gemini 2.5 Pro charges $1.25 / $10.00 per 1M.

Question 3

Which has the bigger context window?

Accepted Answer

Gemini 2.5 Pro is larger (2M) vs 200K on the other.

Question 4

Is there a cached-input discount on either?

Accepted Answer

Claude Haiku 4.5 does not publish a cached-input rate. Gemini 2.5 Pro caches at $0.125 per 1M (90% off). Workloads with repeated static prefixes see the biggest savings.

Question 5

Does Gemini 2.5 Pro have a long-context surcharge?

Accepted Answer

Yes. Above 200K input tokens, Gemini 2.5 Pro bills at $2.50 input / $15.00 output per 1M instead of the standard rate.

Question 6

How fresh is this comparison?

Accepted Answer

Claude Haiku 4.5 was re-verified on 2026-04-06 and Gemini 2.5 Pro on 2026-04-06 against each provider's published rate card. Calcis re-checks every row on a rolling schedule and re-deploys when a provider changes pricing.

Scenario	Tokens (in / out)	Claude Haiku 4.5	Gemini 2.5 Pro	Winner
Short prompt	100 / 200	$0.0011	$0.0021	Claude Haiku 4.5
Typical request	1,000 / 2,000	$0.0110	$0.0213	Claude Haiku 4.5
Long document	10,000 / 5,000	$0.0350	$0.0625	Claude Haiku 4.5
Large prompt	100,000 / 10,000	$0.1500	$0.2250	Claude Haiku 4.5

Traffic	Req / month	Claude Haiku 4.5	Gemini 2.5 Pro	Delta
Small SaaS	1,000	$11.00	$21.25	Claude Haiku 4.5 -$10.25
Growing product	10,000	$110.00	$212.50	Claude Haiku 4.5 -$102.50
Heavy usage	100,000	$1,100	$2,125	Claude Haiku 4.5 -$1,025

Claude Haiku 4.5 vs Gemini 2.5 Pro

Claude Haiku 4.5

Gemini 2.5 Pro

Cost per request

Monthly bill at scale

Which should you use?

Live cost calculator

Try both in the estimator →

Frequently asked