Which is cheapest: OpenAI, Claude or Gemini?

For the very cheapest capable model, Gemini 2.0 Flash ($0.10 input / $0.40 output per 1M tokens) and GPT-4o mini ($0.15 / $0.60) lead. Among flagships, Gemini 2.5 Pro has the lowest input price, GPT-4o sits in the middle, and Claude Opus is the most expensive. The winner depends on the tier you need, not the brand.

Is Claude more expensive than GPT-4o?

Claude Sonnet 4 ($3 input / $15 output) is a little pricier than GPT-4o ($2.50 / $10), and Claude Opus 4 is far more expensive. But Claude Haiku 3.5 undercuts GPT-4o. Compare on the workload you actually run, since output length dominates the bill.

APICostCalc

AI Models Calculator Tools Learn AI APIs Crypto Payments All APIs 🔔 Alerts Blog

OpenAI vs Claude vs Gemini — pricing compared

The three big LLM providers, side by side. See the full price table, price your own usage on each, and find the cheapest one for your use case.

Input tokens / request

Output tokens / request

Requests / day

—

cheapest capable

—cheapest / month

—flagship spread / mo

Full price table — your workload

Reference $/1M tokens (July 2026), monthly cost on your numbers above.

Provider	Model	In $/1M	Out $/1M	Cost / mo

Cheapest by use case

High-volume / simple (classification, extraction, routing): Gemini 2.0 Flash or GPT-4o mini — pennies per thousand requests.
Chatbots & support: GPT-4o mini or Gemini 2.5 Flash — cheap, fast, good enough; see the chatbot cost calculator.
RAG / long context: Gemini 2.5 Flash or Pro (low input price helps when you stuff in retrieved chunks); see the RAG cost calculator.
Hard reasoning / coding: Claude Sonnet 4 or GPT-4o; Claude Opus / o3 only when you truly need it.
Need a free tier: Gemini (AI Studio) is the only one of the three with a genuine free quota.

⚠️ Reference prices, July 2026 — all three providers change pricing regularly. Confirm on each provider's pricing page. Output is billed separately and costs more than input on every model. · Report outdated price →

✓ Last verified: 2026-07-15· Source: official provider pricing page· Auto-monitored — report change →

How to actually choose

Brand loyalty is the most expensive habit in AI. The three providers leapfrog each other constantly, and the real cost difference comes from two things you control: the model tier you pick and how long your outputs are. A frontier model on short answers can be cheaper than a "cheap" model on rambling ones. Price the workload, not the logo — the table above does exactly that on your own numbers.

Drill into one provider with the GPT-4o calculator, the Claude calculator or the Gemini calculator, compare all models at once on the full AI API cost calculator, or estimate a whole product with the AI app cost estimator.

FAQ

Which is cheapest overall? For capable-but-cheap, Gemini 2.0 Flash and GPT-4o mini. For flagships it's closer and workload-dependent.

Does the cheapest model mean lowest total cost? Not always — a slightly pricier model that answers in fewer tokens can win. Compare on your real token counts.

Host your project:DigitalOcean — $200 free ↗Hostinger VPS

Tools & Hosting

📈 TradingView 🔒 NordVPN 💳 Revolut DigitalOcean $200 Hostinger