Question 1

How much does GPT-5.4 nano cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.0009 ($0.0002 input + $0.000625 output). At 10 million requests a month, that's around $9,000.

Question 2

Is GPT-5.4 nano better than GPT-5 nano?

Accepted Answer

Newer generation - so typically a modest reasoning lift on complex tasks. GPT-5 nano is cheaper on input ($0.05 vs $0.20 per 1M) but more expensive on cached input ($0.005 vs $0.02). Pick based on whether the reasoning lift matters or your traffic is latency-sensitive.

Question 3

Does GPT-5.4 nano support the full 1M context?

Accepted Answer

No - 400K context window, same as GPT-5.4 mini. Max output is 128K. For workloads with million-token prompts, use full GPT-5.4 or GPT-4.1.

Question 4

When should I not use GPT-5.4 nano?

Accepted Answer

Any task requiring careful multi-step reasoning, nuanced judgment, or long chain-of-thought. Nano is fast and cheap but not deep - for reasoning workloads, use o3, o4-mini, or full GPT-5.4.

GPT-5.4 nano pricing

Estimate your cost on GPT-5.4 nano

Frequently asked