Question 1

When should I pick GPT-4.1 mini over GPT-5 mini?

Accepted Answer

Pick 4.1 mini when you need a 1M context window (GPT-5 mini tops out at 400K). On pure price, GPT-5 mini is cheaper at $0.25/$2 per 1M vs 4.1 mini's $0.40/$1.60 - but context size flips the decision for long-document work.

Question 2

How much does GPT-4.1 mini cost per request?

Accepted Answer

A 1,000-token prompt with a 500-token reply costs about $0.0012 ($0.0004 input + $0.0008 output). At 1 million requests a month, that's around $1,200.

Question 3

Does GPT-4.1 mini support the full 1M context?

Accepted Answer

Yes - 1,047,576-token context window, same as full GPT-4.1. Max output is 32,768 tokens.

Question 4

What's the cached input discount on GPT-4.1 mini?

Accepted Answer

$0.10 per 1M cached tokens - a 75% discount on the standard $0.40 input rate. Automatic when prompt prefixes repeat within 5 minutes.

GPT-4.1 mini pricing

Estimate your cost on GPT-4.1 mini

Frequently asked