Question 1

Why do OpenAI, Claude, and Gemini give different token counts?

Accepted Answer

Each provider trains its own tokenizer on its own corpus with its own vocabulary size (OpenAI's o200k_base has ~200K tokens, older cl100k_base has ~100K, Claude and Gemini are proprietary but comparable). Different vocabularies produce different splits; a word that's one token for one model might be two or three for another.

Question 2

Is the OpenAI count exact?

Accepted Answer

Yes: it uses the official js-tiktoken library with the o200k_base encoding, which is the same encoding every GPT-4o / GPT-4.1 / GPT-5 model uses. Counts match the OpenAI API exactly.

Question 3

Why are Claude and Gemini counts approximate?

Accepted Answer

Neither Anthropic nor Google ship an offline tokenizer; the only way to get an exact count is to call their API, which would require a roundtrip on every keystroke. We use a character-based approximation (chars ÷ 4) that's accurate to within ±10% for typical English prose.

Question 4

Is my text sent anywhere?

Accepted Answer

No. The counter runs entirely in your browser. OpenAI counting uses js-tiktoken which encodes locally; Claude and Gemini use a pure-JavaScript heuristic. Nothing is uploaded.

Question 5

How do I use this for cost estimation?

Accepted Answer

Multiply the token count by the model's per-token price (e.g. $2.50/1M input for GPT-5, so 1,000 tokens = $0.0025). The /estimator and /calculator pages do this math automatically across every model.

Text	Characters	~Tokens
Hello	5	1
Hello, world!	13	4
The quick brown fox jumps over the lazy dog.	44	11
function add(a, b) { return a + b; }	36	13
pneumonoultramicroscopicsilicovolcanoconiosis	45	11
🎉🚀✨ emoji counts are surprising	30	12

Universal token counter

What is a token?

How tokens relate to characters

Frequently asked

Know the tokens? Get the cost.