LLM API Pricing
Stay up-to-date with the latest LLM pricing from industry leaders like OpenAI, Anthropic, Google, Mistral and more. Our comprehensive tool helps you find the most cost-effective solutions for your AI needs.
Last updated: 2024-11-12
👉 Swipe right to view full table content 👈
Model | Provider | Context | 1M Input Tokens | 1M Output Tokens | Updated Time |
---|---|---|---|---|---|
gpt-4o | OpenAI | 128K | $2.5 | $10 | 2024-11-06 |
gpt-4o-2024-08-06 | OpenAI | 128K | $2.5 | $10 | 2024-11-06 |
gpt-4o-audio-preview (Text) | OpenAI | 128K | $2.5 | $10 | 2024-11-06 |
gpt-4o-audio-preview (Audio) | OpenAI | 128K | $100 | $200 | 2024-11-06 |
gpt-4o-audio-preview-2024-10-01 (Text) | OpenAI | 128K | $2.5 | $10 | 2024-11-06 |
gpt-4o-audio-preview-2024-10-01 (Audio) | OpenAI | 128K | $100 | $200 | 2024-11-06 |
gpt-4o-2024-05-13 | OpenAI | 128K | $5 | $15 | 2024-11-06 |
gpt-4o-mini | OpenAI | 128K | $0.15 | $0.6 | 2024-11-06 |
gpt-4o-mini-2024-07-18 | OpenAI | 128K | $0.15 | $0.6 | 2024-11-06 |
o1-preview | OpenAI | 128K | $15 | $60 | 2024-11-06 |
o1-preview-2024-09-12 | OpenAI | 128K | $15 | $60 | 2024-11-06 |
o1-mini | OpenAI | 128K | $3 | $12 | 2024-11-06 |
o1-mini-2024-09-12 | OpenAI | 128K | $3 | $12 | 2024-11-06 |
gpt-4o-realtime-preview (Text) | OpenAI | 128K | $5 | $20 | 2024-11-06 |
gpt-4o-realtime-preview (Audio) | OpenAI | 128K | $100 | $200 | 2024-11-06 |
gpt-4o-realtime-preview-2024-10-01 (Text) | OpenAI | 128K | $5 | $20 | 2024-11-06 |
gpt-4o-realtime-preview-2024-10-01 (Audio) | OpenAI | 128K | $100 | $200 | 2024-11-06 |
chatgpt-4o-latest | OpenAI | 128K | $5 | $15 | 2024-11-06 |
gpt-4-turbo | OpenAI | 128K | $10 | $30 | 2024-11-06 |
gpt-4-turbo-2024-04-09 | OpenAI | 128K | $10 | $30 | 2024-11-06 |
gpt-4 | OpenAI | 8K | $30 | $60 | 2024-11-06 |
gpt-4-32k | OpenAI | 32K | $60 | $120 | 2024-11-06 |
gpt-4-0125-preview | OpenAI | 128K | $10 | $30 | 2024-11-06 |
gpt-4-1106-preview | OpenAI | 128K | $10 | $30 | 2024-11-06 |
gpt-4-vision-preview | OpenAI | 128K | $10 | $30 | 2024-11-06 |
gpt-3.5-turbo-0125 | OpenAI | 16K | $0.5 | $1.5 | 2024-11-06 |
gpt-3.5-turbo-instruct | OpenAI | 4K | $1.5 | $2 | 2024-11-06 |
gpt-3.5-turbo-1106 | OpenAI | 4K | $1 | $2 | 2024-11-06 |
gpt-3.5-turbo-0613 | OpenAI | 4K | $1.5 | $2 | 2024-11-06 |
gpt-3.5-turbo-16k-0613 | OpenAI | 16K | $3 | $4 | 2024-11-06 |
gpt-3.5-turbo-0301 | OpenAI | 4K | $1.5 | $2 | 2024-11-06 |
davinci-002 | OpenAI | $2 | $2 | 2024-11-06 | |
babbage-002 | OpenAI | $0.4 | $0.4 | 2024-11-06 | |
Claude 3.5 Sonnet | Anthropic | 200K | $3 | $15 | 2024-11-06 |
Claude 3.5 Haiku | Anthropic | 200K | $1 | $5 | 2024-11-06 |
Claude 3 Opus | Anthropic | 200K | $15 | $75 | 2024-11-06 |
Claude 3 Sonnet | Anthropic | 200K | $3 | $15 | 2024-11-06 |
Claude 3 Haiku | Anthropic | 200K | $0.25 | $1.25 | 2024-11-06 |
Claude 2.1 | Anthropic | 200K | $8 | $24 | 2024-11-06 |
Claude 2 | Anthropic | 100K | $8 | $24 | 2024-11-06 |
Claude Instant 1.2 | Anthropic | 100K | $0.8 | $2.4 | 2024-11-06 |
Gemini 1.5 Pro | 128K | $1.25 | $5 | 2024-11-06 | |
Gemini 1.5 Pro | 2M | $2.5 | $10 | 2024-11-06 | |
Gemini 1.5 Flash | 128K | $0.075 | $0.3 | 2024-11-06 | |
Gemini 1.5 Flash | 1M | $0.15 | $0.6 | 2024-11-06 | |
Gemini 1.5 Flash-8B | 128K | $0.0375 | $0.15 | 2024-11-06 | |
Gemini 1.5 Flash-8B | 1M | $0.075 | $0.3 | 2024-11-06 | |
Command R+ | Cohere | 128K | $2.5 | $10 | 2024-11-06 |
Command R | Cohere | 128K | $0.15 | $0.6 | 2024-11-06 |
Mistral Large 2 | Mistral | 128k | $2 | $6 | 2024-11-06 |
Mistral Small 24.09 | Mistral | 128k | $0.2 | $0.6 | 2024-11-06 |
Codestral | Mistral | 32K | $0.2 | $0.6 | 2024-11-06 |
Ministral 3B 24.10 | Mistral | 128K | $0.04 | $0.04 | 2024-11-06 |
Mistral 8B 24.10 | Mistral | 128K | $0.1 | $0.1 | 2024-11-06 |
FAQ
A token is a unit of text used by AI models. It can be a word, part of a word, or even a character. The number of tokens in a piece of text affects the processing time and cost of using an AI model.