Model Costs
Current pricing for all supported AI models per 1 million tokens.
note
Prices are subject to change by the model providers. This page reflects our current rate cards and may not include the most recent provider updates.
OpenAI
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
gpt-3.5-turbo, gpt-3.5-turbo-0613 | $0.5000 | $1.5000 |
gpt-3.5-turbo-instruct | $1.5000 | $2.0000 |
gpt-4, gpt-4-0613 | $30.0000 | $30.0000 |
gpt-4o-mini, gpt-4o-mini-2024-07-18 | $0.1500 | $0.6000 |
gpt-4-32k | $60.0000 | $120.0000 |
gpt-4-vision-preview, gpt-4-turbo-preview | $10.0000 | $30.0000 |
gpt-4-turbo | $10.0000 | $30.0000 |
gpt-4o-2024-05-13 | $5.0000 | $15.0000 |
gpt-4o-2024-08-06, gpt-4o | $2.5000 | $10.0000 |
o1, o1-2024-12-17 | $15.0000 | $60.0000 |
o1-mini, o1-mini-2024-09-12 | $3.0000 | $12.0000 |
o3-mini, o3-mini-2025-01-31, o3-mini-low, o3-mini-medium, o3-mini-high | $1.1000 | $4.4000 |
o3 | $10.0000 | $40.0000 |
o4-mini, o4-mini-low, o4-mini-medium, o4-mini-high | $1.1000 | $4.4000 |
gpt-4.5-preview | $75.0000 | $150.0000 |
gpt-4.1, gpt-4.1-2025-04-14 | $2.0000 | $8.0000 |
gpt-4.1-mini, gpt-4.1-mini-2025-04-14 | $0.4000 | $1.6000 |
gpt-4.1-nano, gpt-4.1-nano-2025-04-14 | $0.1000 | $0.4000 |
gpt-5, gpt-5-2025-08-07 | $1.2500 | $10.0000 |
gpt-5-mini, gpt-5-mini-2025-08-07 | $0.2500 | $2.0000 |
gpt-5-nano, gpt-5-nano-2025-08-07 | $0.0500 | $0.4000 |
Embedding Models
Model | Cost (per 1M tokens) |
---|---|
text-embedding-3-large | $0.1300 |
text-embedding-4-small | $0.0200 |
text-embedding-ada-002 | $0.1000 |
Anthropic
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
claude-opus-4-1-20250805 | $15.0000 | $75.0000 |
claude-opus-4-20250514 | $15.0000 | $75.0000 |
claude-sonnet-4-20250514 | $3.0000 | $15.0000 |
claude-3-5-haiku-20241022 | $0.8000 | $4.0000 |
claude-3-haiku-20240307 | $0.2500 | $1.2500 |
claude-3-sonnet-20240229 | $3.0000 | $15.0000 |
claude-3-5-sonnet-20240620, claude-3-5-sonnet-20241022 | $3.0000 | $15.0000 |
claude-3-7-sonnet-20250219 | $3.0000 | $15.0000 |
claude-3-opus-20240229 | $15.0000 | $75.0000 |
Google Gemini
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
gemini-pro | $1.2500 | $3.7500 |
gemini-2.0-flash-exp | $0.0000 | $0.0000 |
gemini-2.0-flash-001 | $0.3750 | $1.5000 |
gemini-2.5-pro-preview-03-25 | $1.2500 | $10.0000 |
gemini-2.5-flash-preview-04-17 | $0.1500 | $3.5000 |
Replicate
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
meta/llama-2-70b, meta/llama-2-70b-chat | $0.6500 | $2.7500 |
meta/llama-2-13b, meta/llama-2-13b-chat | $0.1000 | $0.5000 |
meta/llama-2-7b, meta/llama-2-7b-chat | $0.0500 | $0.2500 |
mistralai/mistral-7b-v0.1, mistralai/mistral-7b-instruct-v0.2 | $0.0500 | $0.2500 |
mistralai/mixtral-8x7b-instruct-v0.1 | $0.3000 | $1.0000 |
Groq
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
openai/gpt-oss-20b | $0.1000 | $0.5000 |
openai/gpt-oss-120b | $0.1500 | $0.7500 |
moonshotai/kimi-k2-instruct | $1.0000 | $3.0000 |
meta-llama/llama-4-maverick-17b-128e-instruct | $0.5000 | $0.7700 |
meta-llama/llama-4-scout-17b-16e-instruct | $0.1100 | $0.3400 |
deepseek-r1-distill-llama-70b, deepseek-r1-distill-llama-70b-specdec | $8.0000 | $8.0000 |
llama-3.1-405b-reasoning | $0.5900 | $0.7900 |
llama-3.3-70b-versatile, llama-3.1-70b-versatile, llama3-groq-70b-8192-tool-use-preview | $0.5900 | $0.7900 |
llama-3.1-8b-instant, llama3-groq-8b-8192-tool-use-preview | $0.0500 | $0.1000 |
llama3-70b-8192 | $0.5900 | $0.7900 |
llama3-8b-8192 | $0.0500 | $0.1000 |
llama2-70b-4096 | $0.6400 | $0.8000 |
mixtral-8x7b-32768 | $0.2700 | $0.2700 |
gemma-7b-it | $0.1000 | $0.1000 |
OctoAI
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
meta-llama-3.1-8b-instruct | $0.1500 | $0.1500 |
meta-llama-3.1-70b-instruct | $0.9000 | $0.9000 |
meta-llama-3-8b-instruct | $0.1500 | $0.1500 |
meta-llama-3-70b-instruct | $0.9000 | $0.9000 |
mistral-7b-instruct | $0.1500 | $0.1500 |
mixtral-8x7b-instruct | $0.4500 | $0.4500 |
nous-hermes-2-mixtral-8x7b-dpo | $0.4500 | $0.4500 |
mixtral-8x22b-instruct | $1.2000 | $1.2000 |
wizardlm-2-8x22b | $1.2000 | $1.2000 |
llamaguard-2-7b | $0.1500 | $0.1500 |
Embedding Models
Model | Cost (per 1M tokens) |
---|---|
gte-large | $0.0500 |
Perplexity
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
sonar | $1.0000 | $1.0000 |
sonar-pro | $3.0000 | $15.0000 |
sonar-reasoning | $1.0000 | $5.0000 |
sonar-deep-research | $2.0000 | $8.0000 |
llama-3.1-sonar-small-128k-online | $0.2000 | $0.2000 |
llama-3.1-sonar-large-128k-online | $1.0000 | $1.0000 |
llama-3.1-sonar-huge-128k-online | $5.0000 | $5.0000 |
xAI
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
grok-4-0709 | $3.0000 | $15.0000 |
grok-3, grok-3-latest | $3.0000 | $15.0000 |
grok-3-fast, grok-3-fast-latest | $5.0000 | $25.0000 |
grok-3-mini, grok-3-mini-latest | $0.3000 | $0.5000 |
grok-3-mini-fast, grok-3-mini-fast-latest | $0.6000 | $4.0000 |
grok-beta | $5.0000 | $15.0000 |
grok-vision-beta | $5.0000 | $15.0000 |
AWS Bedrock
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
ai21.jamba-instruct-v1:0 | $0.5000 | $0.7000 |
ai21.j2-mid-v1 | $12.5000 | $12.5000 |
ai21.j2-ultra-v1 | $18.8000 | $18.8000 |
amazon.titan-text-express-v1, amazon.titan-text-lite-v1, amazon.titan-text-premier-v1:0 | $0.3000 | $0.4000 |
anthropic.claude-v2, anthropic.claude-v2:1, anthropic.claude-2.1 | $8.0000 | $24.0000 |
anthropic.claude-3-sonnet-20240229-v1:0, anthropic.claude-3-5-sonnet-20240620-v1:0, anthropic.claude-3-5-sonnet-20241022-v2:0 | $3.0000 | $15.0000 |
anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8000 | $4.0000 |
anthropic.claude-3-haiku-20240307-v1:0 | $0.2500 | $1.2500 |
anthropic.claude-3-opus-20240229-v1:0 | $15.0000 | $75.0000 |
anthropic.claude-instant-v1 | $0.8000 | $2.4000 |
cohere.command-text-v14 | $1.5000 | $2.0000 |
cohere.command-light-text-v14 | $0.3000 | $0.6000 |
cohere.command-r-v1:0 | $0.5000 | $1.5000 |
cohere.command-r-plus-v1:0 | $3.0000 | $15.0000 |
meta.llama3-8b-instruct-v1:0, meta.llama3-1-8b-instruct-v1:0 | $0.7500 | $1.0000 |
meta.llama3-70b-instruct-v1:0, meta.llama3-1-70b-instruct-v1:0, meta.llama3-1-405b-instruct-v1:0 | $1.9500 | $2.5600 |
mistral.mistral-7b-instruct-v0:2, mistral.mistral-small-2402-v1:0 | $0.1500 | $0.2000 |
mistral.mixtral-8x7b-instruct-v0:1 | $0.4500 | $0.7000 |
mistral.mistral-large-2402-v1:0, mistral.mistral-large-2407-v1:0 | $8.0000 | $24.0000 |
Embedding Models
Model | Cost (per 1M tokens) |
---|---|
cohere.embed-english-v3.0, cohere.embed-multilingual-v3.0 | $0.1000 |
amazon.titan-embed-text-v1, amazon.titan-embed-text-v2:0 | $0.1000 |
Azure
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
gpt-4o-mini | $0.1500 | $0.6000 |
gpt-4o | $5.0000 | $15.0000 |
gpt-4 | $30.0000 | $60.0000 |
gpt-35-turbo | $0.5000 | $1.5000 |
gpt-4.1 | $2.0000 | $8.0000 |
gpt-4.1-mini | $0.4000 | $1.6000 |
gpt-4.1-nano | $0.1000 | $0.4000 |
Embedding Models
Model | Cost (per 1M tokens) |
---|---|
text-embedding-3-large | $0.1300 |
text-embedding-3-small | $0.0200 |
Fireworks
Language Models
Model | Input Cost (per 1M tokens) | Output Cost (per 1M tokens) |
---|---|---|
accounts/fireworks/models/gpt-oss-20b | $0.0700 | $0.3000 |
accounts/fireworks/models/gpt-oss-120b | $0.1500 | $0.6000 |
accounts/fireworks/models/llama4-maverick-instruct-basic | $0.2200 | $0.8800 |
accounts/fireworks/models/qwen3-235b-a22b | $0.1000 | $0.1000 |
accounts/fireworks/models/llama4-scout-instruct-basic | $0.1500 | $0.6000 |
accounts/fireworks/models/deepseek-r1 | $3.0000 | $8.0000 |
accounts/fireworks/models/deepseek-v3 | $0.9000 | $0.9000 |
accounts/fireworks/models/deepseek-v3-0324 | $1.2000 | $1.2000 |
accounts/fireworks/models/llama-v3p3-70b-instruct | $0.9000 | $0.9000 |
accounts/fireworks/models/llama-v3p3-70b-instruct | $3.0000 | $3.0000 |
accounts/fireworks/models/llama-v3p1-70b-instruct | $0.9000 | $0.9000 |
Last Updated: 8/28/2025
tip
For the most up-to-date pricing and model availability, please check the individual provider's pricing pages.