Cost calculators
By workload
Pick the shape of your workload. Each calculator ranks every tracked LLM cheapest-first for that specific request profile, with tuning inputs and optimization tips.
Classification API cost calculator
Tagging, labelling, routing, and moderation workloads. Short inputs, tiny outputs, huge volume — exactly the shape the cheapest models were built for.
Open calculator →
Embedding API cost calculator
Compute the cost of embedding a corpus. One-off index builds, continuous indexing, and query-time embedding are all priced here.
Open calculator →
Coding agent cost calculator
Agentic coding loops are the most expensive LLM workload there is — many tool calls, big context windows, long outputs. Cost yours honestly.
Open calculator →
Summarization API cost calculator
Cost out a batch of document summaries across every major LLM. Long input, short output — the inverse of a chatbot.
Open calculator →
Chatbot API cost calculator
Estimate what a production chatbot costs per conversation and per month across every major LLM — GPT-5, Claude, Gemini — with realistic defaults.
Open calculator →
RAG pipeline cost calculator
Cost out a retrieval-augmented generation pipeline: embedding queries, retrieving chunks, and paying for the context-heavy chat call on top.
Open calculator →
Need a precise number for your actual prompt?
The workload calculators are great for shortlisting models. For a real prompt, paste it into the estimator and get token-exact costs.