Model Costs

Current pricing for all supported AI models per 1 million tokens.

note

Prices are subject to change by the model providers. This page reflects our current rate cards and may not include the most recent provider updates.

OpenAI

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
gpt-3.5-turbo, gpt-3.5-turbo-0613	$0.5000	$1.5000
gpt-3.5-turbo-instruct	$1.5000	$2.0000
gpt-4, gpt-4-0613	$30.0000	$30.0000
gpt-4o-mini, gpt-4o-mini-2024-07-18	$0.1500	$0.6000
gpt-4-32k	$60.0000	$120.0000
gpt-4-vision-preview, gpt-4-turbo-preview	$10.0000	$30.0000
gpt-4-turbo	$10.0000	$30.0000
gpt-4o-2024-05-13	$5.0000	$15.0000
gpt-4o-2024-08-06, gpt-4o	$2.5000	$10.0000
o1, o1-2024-12-17	$15.0000	$60.0000
o1-mini, o1-mini-2024-09-12	$3.0000	$12.0000
o3-mini, o3-mini-2025-01-31, o3-mini-low, o3-mini-medium, o3-mini-high	$1.1000	$4.4000
o3	$10.0000	$40.0000
o4-mini, o4-mini-low, o4-mini-medium, o4-mini-high	$1.1000	$4.4000
gpt-4.5-preview	$75.0000	$150.0000
gpt-4.1, gpt-4.1-2025-04-14	$2.0000	$8.0000
gpt-4.1-mini, gpt-4.1-mini-2025-04-14	$0.4000	$1.6000
gpt-4.1-nano, gpt-4.1-nano-2025-04-14	$0.1000	$0.4000
gpt-5, gpt-5-2025-08-07	$1.2500	$10.0000
gpt-5-mini, gpt-5-mini-2025-08-07	$0.2500	$2.0000
gpt-5-nano, gpt-5-nano-2025-08-07	$0.0500	$0.4000

Embedding Models

Model	Cost (per 1M tokens)
text-embedding-3-large	$0.1300
text-embedding-4-small	$0.0200
text-embedding-ada-002	$0.1000

Anthropic

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
claude-opus-4-1-20250805	$15.0000	$75.0000
claude-opus-4-20250514	$15.0000	$75.0000
claude-sonnet-4-20250514	$3.0000	$15.0000
claude-3-5-haiku-20241022	$0.8000	$4.0000
claude-3-haiku-20240307	$0.2500	$1.2500
claude-3-sonnet-20240229	$3.0000	$15.0000
claude-3-5-sonnet-20240620, claude-3-5-sonnet-20241022	$3.0000	$15.0000
claude-3-7-sonnet-20250219	$3.0000	$15.0000
claude-3-opus-20240229	$15.0000	$75.0000

Google Gemini

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
gemini-pro	$1.2500	$3.7500
gemini-2.0-flash-exp	$0.0000	$0.0000
gemini-2.0-flash-001	$0.3750	$1.5000
gemini-2.5-pro-preview-03-25	$1.2500	$10.0000
gemini-2.5-flash-preview-04-17	$0.1500	$3.5000

Replicate

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
meta/llama-2-70b, meta/llama-2-70b-chat	$0.6500	$2.7500
meta/llama-2-13b, meta/llama-2-13b-chat	$0.1000	$0.5000
meta/llama-2-7b, meta/llama-2-7b-chat	$0.0500	$0.2500
mistralai/mistral-7b-v0.1, mistralai/mistral-7b-instruct-v0.2	$0.0500	$0.2500
mistralai/mixtral-8x7b-instruct-v0.1	$0.3000	$1.0000

Groq

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
openai/gpt-oss-20b	$0.1000	$0.5000
openai/gpt-oss-120b	$0.1500	$0.7500
moonshotai/kimi-k2-instruct	$1.0000	$3.0000
meta-llama/llama-4-maverick-17b-128e-instruct	$0.5000	$0.7700
meta-llama/llama-4-scout-17b-16e-instruct	$0.1100	$0.3400
deepseek-r1-distill-llama-70b, deepseek-r1-distill-llama-70b-specdec	$8.0000	$8.0000
llama-3.1-405b-reasoning	$0.5900	$0.7900
llama-3.3-70b-versatile, llama-3.1-70b-versatile, llama3-groq-70b-8192-tool-use-preview	$0.5900	$0.7900
llama-3.1-8b-instant, llama3-groq-8b-8192-tool-use-preview	$0.0500	$0.1000
llama3-70b-8192	$0.5900	$0.7900
llama3-8b-8192	$0.0500	$0.1000
llama2-70b-4096	$0.6400	$0.8000
mixtral-8x7b-32768	$0.2700	$0.2700
gemma-7b-it	$0.1000	$0.1000

OctoAI

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
meta-llama-3.1-8b-instruct	$0.1500	$0.1500
meta-llama-3.1-70b-instruct	$0.9000	$0.9000
meta-llama-3-8b-instruct	$0.1500	$0.1500
meta-llama-3-70b-instruct	$0.9000	$0.9000
mistral-7b-instruct	$0.1500	$0.1500
mixtral-8x7b-instruct	$0.4500	$0.4500
nous-hermes-2-mixtral-8x7b-dpo	$0.4500	$0.4500
mixtral-8x22b-instruct	$1.2000	$1.2000
wizardlm-2-8x22b	$1.2000	$1.2000
llamaguard-2-7b	$0.1500	$0.1500

Embedding Models

Model	Cost (per 1M tokens)
gte-large	$0.0500

Perplexity

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
sonar	$1.0000	$1.0000
sonar-pro	$3.0000	$15.0000
sonar-reasoning	$1.0000	$5.0000
sonar-deep-research	$2.0000	$8.0000
llama-3.1-sonar-small-128k-online	$0.2000	$0.2000
llama-3.1-sonar-large-128k-online	$1.0000	$1.0000
llama-3.1-sonar-huge-128k-online	$5.0000	$5.0000

xAI

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
grok-4-0709	$3.0000	$15.0000
grok-3, grok-3-latest	$3.0000	$15.0000
grok-3-fast, grok-3-fast-latest	$5.0000	$25.0000
grok-3-mini, grok-3-mini-latest	$0.3000	$0.5000
grok-3-mini-fast, grok-3-mini-fast-latest	$0.6000	$4.0000
grok-beta	$5.0000	$15.0000
grok-vision-beta	$5.0000	$15.0000

AWS Bedrock

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
ai21.jamba-instruct-v1:0	$0.5000	$0.7000
ai21.j2-mid-v1	$12.5000	$12.5000
ai21.j2-ultra-v1	$18.8000	$18.8000
amazon.titan-text-express-v1, amazon.titan-text-lite-v1, amazon.titan-text-premier-v1:0	$0.3000	$0.4000
anthropic.claude-v2, anthropic.claude-v2:1, anthropic.claude-2.1	$8.0000	$24.0000
anthropic.claude-3-sonnet-20240229-v1:0, anthropic.claude-3-5-sonnet-20240620-v1:0, anthropic.claude-3-5-sonnet-20241022-v2:0	$3.0000	$15.0000
anthropic.claude-3-5-haiku-20241022-v1:0	$0.8000	$4.0000
anthropic.claude-3-haiku-20240307-v1:0	$0.2500	$1.2500
anthropic.claude-3-opus-20240229-v1:0	$15.0000	$75.0000
anthropic.claude-instant-v1	$0.8000	$2.4000
cohere.command-text-v14	$1.5000	$2.0000
cohere.command-light-text-v14	$0.3000	$0.6000
cohere.command-r-v1:0	$0.5000	$1.5000
cohere.command-r-plus-v1:0	$3.0000	$15.0000
meta.llama3-8b-instruct-v1:0, meta.llama3-1-8b-instruct-v1:0	$0.7500	$1.0000
meta.llama3-70b-instruct-v1:0, meta.llama3-1-70b-instruct-v1:0, meta.llama3-1-405b-instruct-v1:0	$1.9500	$2.5600
mistral.mistral-7b-instruct-v0:2, mistral.mistral-small-2402-v1:0	$0.1500	$0.2000
mistral.mixtral-8x7b-instruct-v0:1	$0.4500	$0.7000
mistral.mistral-large-2402-v1:0, mistral.mistral-large-2407-v1:0	$8.0000	$24.0000

Embedding Models

Model	Cost (per 1M tokens)
cohere.embed-english-v3.0, cohere.embed-multilingual-v3.0	$0.1000
amazon.titan-embed-text-v1, amazon.titan-embed-text-v2:0	$0.1000

Azure

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
gpt-4o-mini	$0.1500	$0.6000
gpt-4o	$5.0000	$15.0000
gpt-4	$30.0000	$60.0000
gpt-35-turbo	$0.5000	$1.5000
gpt-4.1	$2.0000	$8.0000
gpt-4.1-mini	$0.4000	$1.6000
gpt-4.1-nano	$0.1000	$0.4000

Embedding Models

Model	Cost (per 1M tokens)
text-embedding-3-large	$0.1300
text-embedding-3-small	$0.0200

Fireworks

Language Models

Model	Input Cost (per 1M tokens)	Output Cost (per 1M tokens)
accounts/fireworks/models/gpt-oss-20b	$0.0700	$0.3000
accounts/fireworks/models/gpt-oss-120b	$0.1500	$0.6000
accounts/fireworks/models/llama4-maverick-instruct-basic	$0.2200	$0.8800
accounts/fireworks/models/qwen3-235b-a22b	$0.1000	$0.1000
accounts/fireworks/models/llama4-scout-instruct-basic	$0.1500	$0.6000
accounts/fireworks/models/deepseek-r1	$3.0000	$8.0000
accounts/fireworks/models/deepseek-v3	$0.9000	$0.9000
accounts/fireworks/models/deepseek-v3-0324	$1.2000	$1.2000
accounts/fireworks/models/llama-v3p3-70b-instruct	$0.9000	$0.9000
accounts/fireworks/models/llama-v3p3-70b-instruct	$3.0000	$3.0000
accounts/fireworks/models/llama-v3p1-70b-instruct	$0.9000	$0.9000

Last Updated: 8/28/2025

tip

For the most up-to-date pricing and model availability, please check the individual provider's pricing pages.

OpenAI​

Language Models​

Embedding Models​

Anthropic​

Language Models​

Google Gemini​

Language Models​

Replicate​

Language Models​

Groq​

Language Models​

OctoAI​

Language Models​

Embedding Models​

Perplexity​

Language Models​

xAI​

Language Models​

AWS Bedrock​

Language Models​

Embedding Models​

Azure​

Language Models​

Embedding Models​

Fireworks​

Language Models​

OpenAI

Language Models

Embedding Models

Anthropic

Language Models

Google Gemini

Language Models

Replicate

Language Models

Groq

Language Models

OctoAI

Language Models

Embedding Models

Perplexity

Language Models

xAI

Language Models

AWS Bedrock

Language Models

Embedding Models

Azure

Language Models

Embedding Models

Fireworks

Language Models