Question 1

How do I estimate chatbot cost?

Accepted Answer

Multiply average input tokens per message (system prompt + history + user turn) by the input rate, add average output tokens times the output rate, then multiply by messages per day and 30 for a monthly bill. The calculator above does this for every model at once.

Question 2

What's the cheapest model for a chatbot?

Accepted Answer

For typical support chat workloads (~700 in / 300 out), GPT-5-nano and Gemini 2.5 Flash-Lite are usually the cheapest, followed by Claude Haiku 4.5. The ranking flips if you enable prompt caching, because Anthropic's 10× cache read discount is aggressive.

Question 3

Does the calculator include prompt caching discounts?

Accepted Answer

No: it shows base input/output rates so the numbers match a worst-case read of the rate card. Caching can cut input cost 50-90% in practice; apply that as a separate mental discount once you've identified your shortlist.

Question 4

How accurate are the output token estimates?

Accepted Answer

The calculator uses the number you enter as-is. If you don't know your real output length, sample 50 actual conversations and average. For quick math, the average customer-support reply is 150-400 tokens; a RAG-style answer with citations is usually 300-600.

Question 5

Should I factor in latency or rate limits?

Accepted Answer

Not for cost modelling: those affect UX, not the bill. But a model with a 2-second tail latency will lose conversations, which reduces volume, which reduces cost. Use the estimator for a single-prompt deep dive once you've picked a shortlist.

Model	Per request	Per day	Monthly (×30)	Details
GPT-5 nanoopenai	$0.00015	$0.15	$4.65	View →
Gemini 2.5 Flash-Litegoogle	$0.00019	$0.19	$5.70	View →
GPT-4o miniopenai	$0.00028	$0.28	$8.55	View →
GPT-5.4 nanoopenai	$0.00051	$0.51	$15.45	View →
Gemini 3.1 Flash-Lite (preview)google	$0.00063	$0.63	$18.75	View →
GPT-5.5 nanoopenai	$0.00063	$0.63	$18.75	View →
GPT-4.1 miniopenai	$0.00076	$0.76	$22.80	View →
GPT-5 miniopenai	$0.00077	$0.78	$23.25	View →

Chatbot API cost calculator

Workload parameters

Top 8 cheapest models for this workload

Scaling GPT-5 nano

Optimisation tips

Frequently asked

Need a precise number for your actual prompt?