HomeAPIs › Cohere

Cohere API — pricing, free trial & how to get a key

Cohere is an enterprise-focused AI provider whose sweet spot is retrieval: its Embed and Rerank models are widely used to power RAG and search, and its Command models handle generation. If you're building a search or knowledge-base product rather than a chatbot, Cohere is often the cheaper, more accurate backbone. Here's what it costs and how to get your key.

Cohere API pricing (reference, June 2026)

ModelInput $/1MOutput $/1MBest for
Command R+$2.50$10.00Flagship generation, RAG
Command R cheap$0.15$0.60High-volume RAG & tools
Embed v3~$0.10 per 1M tokensVector embeddings
Rerank~$2.00 per 1,000 queriesReordering search results
⚠️ Reference prices, June 2026 — Cohere updates pricing and model versions regularly. Confirm on cohere.com/pricing before budgeting. Embed and Rerank are billed differently from chat tokens.

→ Estimate the generation side on the AI API cost calculator, or model a retrieval app with the RAG cost calculator.

Is there a free tier?

Yes — Cohere issues free trial API keys with rate limits, suitable for evaluation and prototyping across Command, Embed and Rerank. You upgrade to a production key with billing when you need higher throughput. For other genuinely free options, compare Gemini and Mistral.

How to get a Cohere API key (step by step)

1. Sign up at dashboard.cohere.com.
2. Open the API Keys page — a free trial key is created for you automatically.
3. Create a production key and add billing when you outgrow the trial limits.
4. Use the official Cohere SDKs, or call the REST endpoints directly.

Test it with a simple request:

# quick test (replace $COHERE_API_KEY)
curl https://api.cohere.com/v2/chat \
-H "Authorization: Bearer $COHERE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"command-r","messages":[{"role":"user","content":"hi"}]}'

Estimate your cost

For chat/generation, use the AI API Cost Calculator to compare Command against GPT-4o, Claude and Gemini. For a full retrieval system — embeddings, vector store, rerank and generation — the RAG cost calculator breaks down where the money actually goes.

Cheaper alternatives

For pure chat, DeepSeek and Gemini Flash undercut Command R+. But for reranking and embeddings specifically, Cohere is often the price-performance leader — compare its Embed/Rerank against OpenAI embeddings before switching. To access Cohere plus many others through one key, see OpenRouter.

FAQ

What is Cohere best at?

Retrieval. Its Embed and Rerank models are built for search and RAG, often delivering better relevance per dollar than general-purpose chat models.

Does Cohere have a free tier?

Yes — free, rate-limited trial keys for evaluation and prototyping, with paid production keys for scale.

How do I get a Cohere API key?

Sign up at dashboard.cohere.com; a free trial key is generated automatically. Create a production key and add billing for higher limits.

Is Cohere good for chatbots?

Command R/R+ handle chat and tool use well, but for open-ended consumer chat the big general models may be cheaper. Cohere shines in RAG and enterprise search.

Not affiliated with Cohere. Prices are reference estimates — always verify on the official pricing page.