Cost calculators

By workload

Pick the shape of your workload. Each calculator ranks every tracked LLM cheapest-first for that specific request profile, with tuning inputs and optimisation tips.

Classification API cost calculator

Tagging, labelling, routing, and moderation workloads. Short inputs, tiny outputs, huge volume: exactly the shape the cheapest models were built for.

Open calculator →

Embedding API cost calculator

Compute the cost of embedding a corpus. One-off index builds, continuous indexing, and query-time embedding are all priced here.

Open calculator →

Coding agent cost calculator

Agentic coding loops are the most expensive LLM workload there is: many tool calls, big context windows, long outputs. Cost yours honestly.

Open calculator →

Summarisation API cost calculator

Cost out a batch of document summaries across every major LLM. Long input, short output: the inverse of a chatbot.

Open calculator →

Chatbot API cost calculator

Estimate what a production chatbot costs per conversation and per month across every major LLM: GPT-5, Claude, Gemini: with realistic defaults.

Open calculator →

RAG pipeline cost calculator

Cost out a retrieval-augmented generation pipeline: embedding queries, retrieving chunks, and paying for the context-heavy chat call on top.

Open calculator →

Need a precise number for your actual prompt?

The workload calculators are great for shortlisting models. For a real prompt, paste it into the estimator and get token-exact costs.

Open the estimator →