Question 1

How much does GPT-5 mini cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.00125 ($0.00025 input + $0.001 output). At 1 million requests a month, that's around $1,250.

Question 2

Is GPT-5 mini good enough for production?

Accepted Answer

For chat, summarisation, classification, and routing workloads, yes - benchmarks repeatedly show mini at parity with full GPT-5 on these tasks. For complex reasoning, long chain-of-thought, or code generation, test both and compare on your actual workload.

Question 3

Does GPT-5 mini support the same context as full GPT-5?

Accepted Answer

Yes - 400K context window and 128K max output, identical to full GPT-5. The difference is model capacity, not plumbing.

Question 4

What's the cached input discount on GPT-5 mini?

Accepted Answer

$0.025 per 1M cached tokens - a 90% discount on the standard $0.25 input rate. Applied automatically when prompt prefixes repeat within 5 minutes.

GPT-5 mini pricing

Estimate your cost on GPT-5 mini

Frequently asked