50 models. Gemini 2.5 Pro, Flash, and all Google AI models. Prices per 1M tokens in USD.
Cheapest input
$0.0375/1M
gemini-flash-1.5-8b
Most expensive input
$15/1M
claude-4-opus
Models with cache pricing
32 of 50
| Model | Input $/1M | Output $/1M | Cache $/1M |
|---|---|---|---|
| gemini-flash-1.5-8b | $0.0375 | $0.15 | $0.01 |
| gemini-1.5-flash | $0.075 | $0.3 | $0.01875 |
| gemini-2.0-flash-lite | $0.075 | $0.3 | - |
| gemini-flash-1.5 | $0.075 | $0.3 | $0.01875 |
| gemini-2.0-flash-lite-001 | $0.075 | $0.3 | $0.0187 |
| gemini-2.0-flash | $0.1 | $0.4 | $0.025 |
| gemini-2.5-flash-lite | $0.1 | $0.4 | $0.01 |
| gemini-2.0-flash-001 | $0.1 | $0.4 | $0.025 |
| gemini-2.5-flash-lite-preview-09-2025 | $0.1 | $0.4 | $0.01 |
| gemini-flash-lite-latest | $0.1 | $0.4 | $0.025 |
| gemini-2.5-flash-lite-preview-06-17 | $0.1 | $0.4 | $0.025 |
| gemini-1.0-pro-vision-001 | $0.125 | $0.375 | - |
| gemini-pro | $0.125 | $0.375 | - |
| gemini-2.5-flash-preview | $0.15 | $0.6 | - |
| claude-3-haiku | $0.25 | $1.25 | $0.03 |
| gemini-3.1-flash-lite-preview | $0.25 | $1.5 | $0.025 |
| gemini-2.5-flash | $0.3 | $2.5 | $0.03 |
| gemini-2.5-flash-image | $0.3 | $30 | - |
| gemini-live-2.5-flash-preview-native-audio-09-2025 | $0.3 | $2 | $0.075 |
| gemini-robotics-er-1.5-preview | $0.3 | $2.5 | - |
| gemini-2.5-flash-preview-09-2025 | $0.3 | $2.5 | $0.075 |
| gemini-flash-latest | $0.3 | $2.5 | $0.075 |
| gemini-2.5-flash-preview-tts | $0.3 | $2.5 | - |
| gemini-2.5-flash-native-audio-latest | $0.3 | $2.5 | - |
| gemini-2.5-flash-native-audio-preview-09-2025 | $0.3 | $2.5 | - |
| gemini-2.5-flash-native-audio-preview-12-2025 | $0.3 | $2.5 | - |
| gemini-exp-1206 | $0.3 | $2.5 | $0.03 |
| gemini-gemma-2-27b-it | $0.35 | $1.05 | - |
| gemini-gemma-2-9b-it | $0.35 | $1.05 | - |
| gemini-3-flash-preview | $0.5 | $3 | $0.05 |
| gemini-3.1-flash-image-preview | $0.5 | $60 | - |
| gemini-live-2.5-flash-preview | $0.5 | $2 | - |
| claude-3-5-haiku | $0.8 | $4 | $0.08 |
| gemini-2.5-pro | $1.25 | $10 | $0.125 |
| gemini-1.5-pro | $1.25 | $5 | - |
| gemini-pro-1.5 | $1.25 | $5 | $0.3125 |
| gemini-2.5-computer-use-preview-10-2025 | $1.25 | $10 | - |
| gemini-2.5-pro-preview-tts | $1.25 | $10 | $0.125 |
| gemini-pro-latest | $1.25 | $10 | $0.125 |
| gemini-3-pro-image-preview | $2 | $120 | - |
| gemini-3-pro-preview | $2 | $12 | $0.2 |
| gemini-3.1-pro-preview | $2 | $12 | $0.2 |
| deep-research-pro-preview-12-2025 | $2 | $12 | - |
| gemini-3.1-pro-preview-customtools | $2 | $12 | $0.2 |
| claude-3-5-sonnet | $3 | $15 | $0.3 |
| claude-3-7-sonnet | $3 | $15 | $0.3 |
| claude-4-sonnet | $3 | $15 | $0.3 |
| claude-opus-4-6 | $5 | $25 | $0.5 |
| claude-3-opus | $15 | $75 | $1.5 |
| claude-4-opus | $15 | $75 | $1.5 |
Proxy your Google Gemini requests through LLMKit. Every call gets logged with token counts, dollar costs, and session attribution. Set budget limits that actually reject requests before they hit the provider.
MIT licensed. Built with Claude Code. Source on GitHub