Name: DeepSeek V4 Flash
Brand: DeepSeek
Price: 0.14 USD
Availability: InStock

Question 1

How much does DeepSeek V4 Flash cost per 1M tokens?

Accepted Answer

$0.14 for input and $0.28 for output per 1M tokens. Cached input is $0.0028 per 1M tokens, a 98% discount on cache hits.

Question 2

What is the DeepSeek V4 Flash context window?

Accepted Answer

1M tokens, with up to 384K output tokens - the same envelope as the larger V4 Pro.

Question 3

How much cheaper is V4 Flash than V4 Pro?

Accepted Answer

Roughly 12x on input ($0.14 vs $1.74) and 12x on output ($0.28 vs $3.48) at standard rates. V4 Flash is built for high-volume, latency-sensitive work; V4 Pro is the flagship for the hardest tasks.

Question 4

How does the cached-input discount work?

Accepted Answer

On a cache hit, input tokens bill at $0.0028 per 1M instead of $0.14 - about 98% off. Workloads with large repeated prefixes benefit most.

Question 5

Is DeepSeek V4 Flash open weight?

Accepted Answer

Yes, released under the MIT licence, so you can self-host as well as call the hosted API.

DeepSeek V4 Flash pricing

Estimate LLM costs before you send

Frequently asked