⚡ Curated ranking

Cheapest AI models with prompt caching

Prompt caching can cut costs by ~90% for workloads with repeated system prompts (RAG, long-context chat, structured workflows). Ranked by cached-input price ascending so you can see the actual savings rate per model.

🔔
Depend on one of these models?
Watchlist alerts are coming soon — pick the models you rely on and we'll email you the moment their price, capabilities, or lifecycle change. Reserve a spot and we'll let you know when it's live.
Reserve early access →
# Model Provider Cached input price Lifecycle Verified
1
Gemini 2.5 Flash-Lite
Google Gemini · Active
GG Google Gemini $0.01/M cached Active · 12h ago
2
GPT-5.4 Nano
OpenAI · Active
OP OpenAI $0.02/M cached Active · 11h ago
3
GPT-5 Mini
OpenAI · Active
OP OpenAI $0.025/M cached Active · 11h ago
4
Gemini 3.1 Flash-Lite
Google Gemini · Active
GG Google Gemini $0.025/M cached Active · 11h ago
5
Gemini 2.5 Flash
Google Gemini · Active
GG Google Gemini $0.03/M cached Active · 12h ago
6
Gemini 3 Flash Preview
Google Gemini · Active
GG Google Gemini $0.05/M cached Active · 11h ago
7
GPT-4o Mini
OpenAI · Active
OP OpenAI $0.075/M cached Active · 11h ago
8
GPT-5.4 Mini
OpenAI · Active
OP OpenAI $0.075/M cached Active · 11h ago
9
Claude Haiku 3.5
Anthropic · Active
AN Anthropic $0.08/M cached Active · 12h ago
10
GPT-4.1 Mini
OpenAI · Active
OP OpenAI $0.1/M cached Active · 11h ago
11
Claude Haiku 4.5
Anthropic · Active
AN Anthropic $0.1/M cached Active · 12h ago
12
GPT-5
OpenAI · Active
OP OpenAI $0.125/M cached Active · 11h ago
13
GPT-5.1
OpenAI · Active
OP OpenAI $0.125/M cached Active · 11h ago
14
Gemini 2.5 Pro
Google Gemini · Active
GG Google Gemini $0.125/M cached Active · 12h ago
15
Gemini 3.1 Flash Image 🍌
Google Gemini · Active
GG Google Gemini $0.15/M cached Active · 11h ago
16
Gemini 3.5 Flash
Google Gemini · Active
GG Google Gemini $0.15/M cached Active · 11h ago
17
Gemini 3.1 Flash Image Preview 🍌
Google Gemini · Active
GG Google Gemini $0.151/M cached Active · 11h ago
18
GPT-5.2
OpenAI · Active
OP OpenAI $0.175/M cached Active · 11h ago
19
Gemini 3.1 Flash Live Preview
Google Gemini · Active
GG Google Gemini $0.18/M cached Active · 11h ago
20
Grok 4.20
xAI · Active
XA xAI $0.2/M cached Active · 11h ago
21
Grok 4.3
xAI · Active
XA xAI $0.2/M cached Active · 11h ago
22
Grok Build 0.1
xAI · Active
XA xAI $0.2/M cached Active · 11h ago
23
GPT-5.4
OpenAI · Active
OP OpenAI $0.25/M cached Active · 11h ago
24
o4-mini
OpenAI · Active
OP OpenAI $0.275/M cached Active · 11h ago
25
Claude Sonnet 4
Anthropic · Active
AN Anthropic $0.3/M cached Active · 12h ago
Showing top 25 of 36 matches.
Methodology

How this ranking is built

Active models offering prompt caching with a published cached-input price. Ranked from cheapest cached-input cost upward — that's the rate you actually pay on cache hits, which is where the savings live.

Refreshed as we re-verify each model against its provider's official docs. Every value links to the page it came from. Last updated Jun 3, 2026 19:44 UTC.