OpenAI

GPT-4.1 mini pricing

Previous-generation mini at 1M context. 5x cheaper than full 4.1 for compatible workloads.

Input

$0.40/ 1M tok

Output

$1.60/ 1M tok

Context window
1.0M
Max output
33K
Cached input
$0.100 / 1M
Verified
2026-04-06

GPT-4.1 mini carries the 1M context window of the full GPT-4.1 at a fraction of the price: $0.40 per 1M input and $1.60 per 1M output. For document-heavy workloads that need the full million tokens of context but don't need the flagship's capability, this is often the right landing spot.

A 10K-token prompt with a 2K-token reply costs about $0.0072 on 4.1 mini - versus $0.028 on full 4.1 for the same shape. Cached input at $0.10 per 1M (a 75% discount) is less aggressive than the 90% on GPT-5-era models, but still substantial for repeated-prefix document workflows.

Calcis counts GPT-4.1 mini input tokens with o200k_base (tiktoken), the same tokenizer OpenAI uses for billing - so the token count you see before you send matches the one that lands on your invoice.

Estimate your cost on GPT-4.1 mini

Paste your prompt into the estimator, pick GPT-4.1 mini, and see the exact dollar cost - input tokens counted with the provider's own tokenizer, output tokens predicted by our regression model.

Frequently asked

When should I pick GPT-4.1 mini over GPT-5 mini?
Pick 4.1 mini when you need a 1M context window (GPT-5 mini tops out at 400K). On pure price, GPT-5 mini is cheaper at $0.25/$2 per 1M vs 4.1 mini's $0.40/$1.60 - but context size flips the decision for long-document work.
How much does GPT-4.1 mini cost per request?
A 1,000-token prompt with a 500-token reply costs about $0.0012 ($0.0004 input + $0.0008 output). At 1 million requests a month, that's around $1,200.
Does GPT-4.1 mini support the full 1M context?
Yes - 1,047,576-token context window, same as full GPT-4.1. Max output is 32,768 tokens.
What's the cached input discount on GPT-4.1 mini?
$0.10 per 1M cached tokens - a 75% discount on the standard $0.40 input rate. Automatic when prompt prefixes repeat within 5 minutes.

Pricing verified 2026-04-06 from the provider's rate card.