Understanding prompt caching pricing and mechanics across LLM providers
Operation | Multiplier | Example (Claude Sonnet @ $3/MTok) |
---|---|---|
Cache Read | 0.1× | $0.30/MTok |
Cache Write (5 min) | 1.25× | $3.75/MTok |
Cache Write (1 hour) | 2.0× | $6.00/MTok |
Operation | Multiplier | Storage Cost |
---|---|---|
Cache Read | 0.25× | N/A |
Cache Write | 1.0× | + Storage fee |
Context Size | Input Price | Cache Read | Cache Write (5 min) |
---|---|---|---|
≤200K tokens | $1.25/MTok | $0.31/MTok | $1.625/MTok |
>200K tokens | $2.50/MTok | $0.625/MTok | $2.875/MTok |