LLM Cost Calculator

Enter your expected token usage. See what it costs across 731 models from 9 providers.

ProviderModelInputOutputPer request ^Monthly
fireworksSSD-1B<$0.001<$0.001<$0.001<$0.001
fireworksjapanese-stable-diffusion-xl<$0.001<$0.001<$0.001<$0.001
fireworksplayground-v2-1024px-aesthetic<$0.001<$0.001<$0.001<$0.001
fireworksplayground-v2-5-1024px-aesthetic<$0.001<$0.001<$0.001<$0.001
fireworksstable-diffusion-xl-1024-v1-0<$0.001<$0.001<$0.001<$0.001
fireworksflux-1-schnell-fp8<$0.001<$0.001<$0.001<$0.001
fireworksflux-1-dev-fp8<$0.001<$0.001<$0.001<$0.001
fireworksflux-1-dev-controlnet-union<$0.001<$0.001<$0.001$0.0015
groqllama-3.2-1b-preview<$0.001<$0.001<$0.001$0.060
fireworksflux-kontext-pro<$0.001<$0.001<$0.001$0.060
mistralministral-3b<$0.001<$0.001<$0.001$0.060
groqllama-3.1-8b-instant<$0.001<$0.001<$0.001$0.090
groqllama3-8b-8192<$0.001<$0.001<$0.001$0.090
mistralmistral-small-24b-instruct-2501<$0.001<$0.001<$0.001$0.090
groqllama-3.2-3b-preview<$0.001<$0.001<$0.001$0.090
groqgemma-7b-it<$0.001<$0.001<$0.001$0.105
geminigemini-flash-1.5-8b<$0.001<$0.001<$0.001$0.113
mistraldevstral-small<$0.001<$0.001<$0.001$0.120
fireworksflux-kontext-max<$0.001<$0.001<$0.001$0.120
mistralmistral-small-3-2-2506<$0.001<$0.001<$0.001$0.150
togethergpt-oss-20b<$0.001<$0.001<$0.001$0.150
togetherQwen/Qwen1.5-0.5B<$0.001<$0.001<$0.001$0.150
togetherQwen/Qwen1.5-1.8B<$0.001<$0.001<$0.001$0.150
togetherQwen/Qwen1.5-4B<$0.001<$0.001<$0.001$0.150
togethergoogle/gemma-2b<$0.001<$0.001<$0.001$0.150
togethermeta-llama/Meta-Llama-3-8B-Instruct-Lite<$0.001<$0.001<$0.001$0.150
togethermicrosoft/phi-2<$0.001<$0.001<$0.001$0.150
togethertogethercomputer/RedPajama-INCITE-Base-3B-v1<$0.001<$0.001<$0.001$0.150
togethertogethercomputer/RedPajama-INCITE-Chat-3B-v1<$0.001<$0.001<$0.001$0.150
togethertogethercomputer/RedPajama-INCITE-Instruct-3B-v1<$0.001<$0.001<$0.001$0.150
togethertogether-ai-up-to-4b<$0.001<$0.001<$0.001$0.150
fireworksgemma-3-27b-it<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p2-1b-instruct<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p2-3b-instruct<$0.001<$0.001<$0.001$0.150
fireworkscodegemma-2b<$0.001<$0.001<$0.001$0.150
fireworkscogito-v1-preview-llama-3b<$0.001<$0.001<$0.001$0.150
fireworksdeepseek-coder-1b-base<$0.001<$0.001<$0.001$0.150
fireworksdeepseek-r1-distill-qwen-1p5b<$0.001<$0.001<$0.001$0.150
fireworksernie-4p5-21b-a3b-pt<$0.001<$0.001<$0.001$0.150
fireworksernie-4p5-300b-a47b-pt<$0.001<$0.001<$0.001$0.150
fireworksflux-1-dev<$0.001<$0.001<$0.001$0.150
fireworksflux-1-schnell<$0.001<$0.001<$0.001$0.150
fireworksgemma-2b-it<$0.001<$0.001<$0.001$0.150
fireworksllama-guard-3-1b<$0.001<$0.001<$0.001$0.150
fireworksllama-v2-70b<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p1-405b-instruct-long<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p1-70b-instruct-1b<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p2-1b<$0.001<$0.001<$0.001$0.150
fireworksllama-v3p2-3b<$0.001<$0.001<$0.001$0.150
fireworksminimax-m1-80k<$0.001<$0.001<$0.001$0.150

Showing top 50 cheapest models. 731 total across 9 providers. Data from pricing.json, updated weekly. Full table | API

Stop guessing, start tracking

LLMKit tracks actual costs per request, per session, per user. Budget limits reject requests before they reach the provider.

MIT licensed. Built with Claude Code. Source on GitHub