โ€”
cheapest capable
โ€”cheapest / month
โ€”flagship spread / mo

Full price table โ€” your workload

Reference $/1M tokens (June 2026), monthly cost on your numbers above.

ProviderModelIn $/1MOut $/1MCost / mo

Cheapest by use case

โš ๏ธ Reference prices, June 2026 โ€” all three providers change pricing regularly. Confirm on each provider's pricing page. Output is billed separately and costs more than input on every model.

How to actually choose

Brand loyalty is the most expensive habit in AI. The three providers leapfrog each other constantly, and the real cost difference comes from two things you control: the model tier you pick and how long your outputs are. A frontier model on short answers can be cheaper than a "cheap" model on rambling ones. Price the workload, not the logo โ€” the table above does exactly that on your own numbers.

Drill into one provider with the GPT-4o calculator, the Claude calculator or the Gemini calculator, compare all models at once on the full AI API cost calculator, or estimate a whole product with the AI app cost estimator.

FAQ

Which is cheapest overall? For capable-but-cheap, Gemini 2.0 Flash and GPT-4o mini. For flagships it's closer and workload-dependent.

Does the cheapest model mean lowest total cost? Not always โ€” a slightly pricier model that answers in fewer tokens can win. Compare on your real token counts.