Skip to content

Token Pricing Comparison

Token Pricing Comparison

Prices are per 1 million tokens. All figures are approximate and subject to change.

OpenAI

ModelInputOutput
GPT-4o$2.50$10.00
GPT-4o mini$0.15$0.60
GPT-4 Turbo$10.00$30.00
GPT-3.5 Turbo$0.50$1.50

Anthropic

ModelInputOutput
Claude 3.5 Sonnet$3.00$15.00
Claude 3 Opus$15.00$75.00
Claude 3 Sonnet$3.00$15.00
Claude 3 Haiku$0.25$1.25

Google

ModelInputOutput
Gemini 1.5 Pro$3.50$10.50
Gemini 1.5 Flash$0.075$0.30
Gemini 1.0 Pro$0.50$1.50

Mistral

ModelInputOutput
Mistral Large$4.00$12.00
Mistral Medium$2.75$8.10
Mistral Small$1.00$3.00
Mistral Tiny$0.25$0.25

Cohere

ModelInputOutput
Command R+$3.00$15.00
Command R$0.50$1.50

Cost Optimization Tips

  • Use smaller models first. Start with mini/flash/tiny variants and escalate to larger models only when needed.
  • Cache repeated prompts. Prompt caching can reduce costs for repeated context.
  • Shorten output where possible. Set max_tokens to reasonable limits.
  • Batch requests. Some providers offer discounted batch pricing.