Question 1

How much does GPT-5.4 mini cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.003 ($0.00075 input + $0.00225 output). At 100,000 requests a month, that's around $300 in API fees.

Question 2

Is GPT-5.4 mini better than GPT-4o mini?

Accepted Answer

On most benchmarks, yes - newer generation, better reasoning. On price, GPT-4o mini is cheaper ($0.15 / $0.60 vs $0.75 / $4.50 per 1M). Pick 5.4 mini when you need the reasoning lift; stay on 4o mini for pure price-sensitive workloads.

Question 3

Does GPT-5.4 mini support the full 1M context?

Accepted Answer

No - the context window is 400K tokens vs 1M on the full GPT-5.4. Max output is 128K, same as the full model.

Question 4

What's the cached input discount on GPT-5.4 mini?

Accepted Answer

$0.075 per 1M cached tokens - a 90% discount on the standard $0.75 input rate. Applied automatically when prompt prefixes repeat within 5 minutes.

GPT-5.4 mini pricing

Estimate your cost on GPT-5.4 mini

Frequently asked