Skip to main content

Models

Language Model Providers

Ollama

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
DeepSeek V3deepseek-v3DeepSeek V3 model163840
DeepSeek R1 1.5Bdeepseek-r1:1.5bDeepSeek R1 1.5B Qwen model131072
DeepSeek R1 7Bdeepseek-r1:7bDeepSeek R1 7B Qwen model131072
DeepSeek R1 8Bdeepseek-r1:8bDeepSeek R1 8B Llama model131072
DeepSeek R1 14Bdeepseek-r1:14bDeepSeek R1 14B Qwen model131072
DeepSeek R1 32Bdeepseek-r1:32bDeepSeek R1 32B Qwen model131072
DeepSeek R1 70Bdeepseek-r1:70bDeepSeek R1 70B Llama model131072
DeepSeek R1 671Bdeepseek-r1:671bDeepSeek R1 671B model131072
Llama3 7bllama3:latestLlama 38192
Llama 2-7bllama2:latestLlama 28192
Mistralmistral:latestMistral8192
Code Llamacodellama:7b-codeCode Llama8192

Replicate

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Mixtral 8x7b instructmistralai/mixtral-8x7b-instruct-v0.1Mixtral 8x7b instruct128000
Mistral 7b instruct v0.2mistralai/mistral-7b-instruct-v0.2The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an improved instruct fine-tuned version of Mistral-7B-Instruct-v0.1.128000
Mistral 7b instruct v0.1mistral-7b-instruct-v0.1An instruction-tuned 7 billion parameter language model from Mistral128000
Mixtral 8x7b instruct v0.1mistralai/mixtral-8x7b-instruct-v0.1The Mixtral-8x7B-instruct-v0.1 Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts tuned to be a helpful assistant.128000
Llama 2 13b chatmeta/llama-2-13b-chatA 13 billion parameter language model from Meta, fine tuned for chat completions128000
Llama 2 70b chatmeta/llama-2-70b-chatA 70 billion parameter language model from Meta, fine tuned for chat completions128000

OpenAI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
4.1gpt-4.1OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.1047576
4.1 minigpt-4.1-miniGPT 4.1 mini provides a balance between intelligence, speed, and cost that makes it an attractive model for many use cases.1047576
4.1 nanogpt-4.1-nanoGPT-4.1 nano is the fastest, most cost-effective GPT 4.1 model.1047576
4ogpt-4oAdvanced, multimodal flagship model that's cheaper and faster than GPT-4 Turbo128000
4o-minigpt-4o-miniAffordable and intelligent small model for fast, lightweight tasks. GPT-4o mini is cheaper and more capable than GPT-3.5 Turbo. Currently points to gpt-4o-mini-2024-07-18.128000
o3-mini (Low Reasoning)o3-mini-lowFast and efficient o3-mini model with low reasoning effort. Optimized for quick responses with basic reasoning.200000
o3-mini (Medium Reasoning)o3-mini-mediumBalanced o3-mini model with medium reasoning effort. Good for general-purpose tasks requiring moderate analysis.200000
o3-mini (High Reasoning)o3-mini-highThorough o3-mini model with high reasoning effort. Best for complex tasks requiring deep analysis.200000
o3o3o3 is a powerful reasoning model designed for complex problem-solving across domains. It combines advanced reasoning capabilities with high performance for demanding tasks.200000
o4-mini (Low Reasoning)o4-mini-lowFast and efficient o4-mini model with low reasoning effort. Optimized for quick responses with basic reasoning.200000
o4-mini (Medium Reasoning)o4-mini-mediumBalanced o4-mini model with medium reasoning effort. Good for general-purpose tasks requiring moderate analysis.200000
o4-mini (High Reasoning)o4-mini-highThorough o4-mini model with high reasoning effort. Best for complex tasks requiring deep analysis.200000
o4-minio4-minio4-mini is a compact and efficient model that delivers strong performance for a wide range of tasks. It offers a good balance of capabilities and resource efficiency.200000
o1o1o1 is a reasoning model designed to solve hard problems across domains. The o1 series of models are trained with reinforcement learning to perform complex reasoning. o1 models think before they answer, producing a long internal chain of thought before responding to the user.200000
o1-minio1-minio1-mini is a fast and affordable reasoning model for specialized tasks. The o1-mini series of models are trained with reinforcement learning to perform complex reasoning. o1-mini models think before they answer, producing a long internal chain of thought before responding to the user.128000
4.5gpt-4.5-previewThis is a research preview of GPT-4.5, OpenAI's largest and most capable GPT model yet. Its deep world knowledge and better understanding of user intent makes it good at creative tasks and agentic planning.128000
4.1 2025-04-14gpt-4.1-2025-04-14OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.1047576
4.1 mini 2025-04-14gpt-4.1-mini-2025-04-14GPT 4.1 mini provides a balance between intelligence, speed, and cost that makes it an attractive model for many use cases.1047576
4.1 nano 2025-04-14gpt-4.1-nano-2025-04-14GPT-4.1 nano is the fastest, most cost-effective GPT 4.1 model.1047576
4o 2024-08-06gpt-4o-2024-08-062024-08-06 version of gpt-4o128000
4o-mini 2024-07-18gpt-4o-mini-2024-07-182024-07-18 version of gpt-4o-mini128000
o1 2024-12-17o1-2024-12-172024-12-17 version of o1200000
o1-mini 2024-09-12o1-mini-2024-09-122024-09-12 version of o1-mini128000
4 Turbogpt-4-turboThe latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more.128000
4 Turbo Previewgpt-4-turbo-previewThe latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Returns a maximum of 4,096 output tokens. This preview model is not yet suited for production traffic.128000
4 Visiongpt-4-vision-previewGPT-4 with the ability to understand images, in addition to all other GPT-4 Turbo capabilities.128000
4gpt-4More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. Will be updated with our latest model iteration.8192
4 32Kgpt-4-32kSame capabilities as the base gpt-4 mode but with 4x the context length. Will be updated with our latest model iteration.32768
4 Turbo 2024-04-09gpt-4-turbo-2024-04-09Advanced, multimodal flagship model that's cheaper and faster than GPT-4 Turbo128000
3.5 Turbogpt-3.5-turboMost capable GPT-3.5 model and optimized for chat at 1/10th the cost of text-davinci-003. Will be updated with our latest model iteration.4096
3.5 Turbo 16Kgpt-3.5-turbo-16kSame capabilities as the base gpt-3.5-turbo model but with 4x the context length. Will be updated with our latest model iteration.16384
4 0613gpt-4-0613More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. Will be updated with our latest model iteration.8192
3.5 Turbo 0613gpt-3.5-turbo-0613Most capable GPT-3.5 model and optimized for chat at 1/10th the cost of text-davinci-003. Will be updated with our latest model iteration.4096

Groq

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
GPT-OSS 20Bopenai/gpt-oss-20bOpenAI's flagship open source model, built on a Mixture-of-Experts (MoE) architecture with 20 billion parameters and 32 experts. Features tool use, browser search, code execution, JSON object mode, and reasoning capabilities.131072
GPT-OSS 120Bopenai/gpt-oss-120bOpenAI's flagship open source model, built on a Mixture-of-Experts (MoE) architecture with 20 billion parameters and 128 experts. Features tool use, browser search, code execution, JSON object mode, and reasoning capabilities.131072
Llama 4 Maverickmeta-llama/llama-4-maverick-17b-128e-instructLlama 4 Maverick131072
Llama 4 Scoutmeta-llama/llama-4-scout-17b-16e-instructLlama 4 Scout131072
DeepSeek R1 Distilled Llama 70Bdeepseek-r1-distill-llama-70bDeepSeek R1 Distilled Llama 70B128000
DeepSeek R1 Distilled Llama 70B SpecDecdeepseek-r1-distill-llama-70b-specdecDeepSeek R1 Distilled Llama 70B SpecDec128000
Llama 3.1 405B Reasoningllama-3.1-405b-reasoningLlama 3.1 405B Reasoning131072
Llama 3.3 70B Versatilellama-3.3-70b-versatileLlama 3.3 70B Versatile32768
Llama 3.3 70B SpecDecllama-3.3-70b-specdecLlama 3.3 70B SpecDec8192
Llama 3.1 70B Versatile (Tool Use Preview)llama3-groq-70b-8192-tool-use-previewLlama 3.1 70B Versatile (Tool Use Preview)8192
Llama 3.1 70B Versatilellama-3.1-70b-versatileLlama 3.1 70B Versatile131072
Llama 3.1 8B Instant (Tool Use Preview)llama3-groq-8b-8192-tool-use-previewLlama 3.1 8B Instant (Tool Use Preview)8192
Llama 3.1 8B Instantllama-3.1-8b-instantLlama 3.1 8B Instant131072
LLaMA3-70bllama3-70b-8192LLaMA3-70b8192
LLaMA3-8bllama3-8b-8192LLaMA3-8b8192
LLaMA2-70bllama2-70b-4096LLaMA2-70b4096
Mixtral-8x7bmixtral-8x7b-32768Mixtral-8x7b32768
Gemma-7b-itgemma-7b-itGemma-7b-it8192

Google Generative AI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Gemini 2.5 Pro Preview 03-25gemini-2.5-pro-preview-03-25Gemini 2.5 Pro Preview 03-251000000
Gemini 2.5 Flash Preview 04-17gemini-2.5-flash-preview-04-17Gemini 2.5 Flash Preview 04-171000000
Gemini 2.0 Flashgemini-2.0-flash-001Gemini 2.0 Flash1000000
Gemini 2.0 Flash Experimentalgemini-2.0-flash-expGemini 2.0 Flash Experimental1000000
Gemini 1.0 Progemini-proGemini 1.0 Pro32000

Anthropic Claude

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Claude Opus 4.1claude-opus-4-1-20250805Anthropic's most capable and intelligent model yet. Claude Opus 4.1 sets new standards in complex reasoning and advanced coding.200000
Claude Opus 4claude-opus-4-20250514Anthropic's most capable model with highest level of intelligence and capability. Features extended thinking and priority tier access.200000
Claude Sonnet 4claude-sonnet-4-20250514Anthropic's high-performance model with balanced intelligence and speed. Features extended thinking and priority tier access.200000
Claude 3.7 Sonnetclaude-3-7-sonnet-20250219Anthropic's most intelligent model. Highest level of intelligence and capability with toggleable extended thinking. This is the latest version of the model.200000
Claude 3.5 Sonnet (V2)claude-3-5-sonnet-20241022Anthropic's previous most intelligent model. High level of intelligence and capability.200000
Claude 3.5 Sonnet (V1)claude-3-5-sonnet-20240620Anthropic's previous most intelligent model. High level of intelligence and capability.200000
Claude 3.5 Haikuclaude-3-5-haiku-20241022Anthropic's fastest model that can execute lightweight actions, with industry-leading speed.200000
Claude 3 Opusclaude-3-opus-20240229Most powerful model for highly complex tasks, offering top-level performance with multilingual and vision capabilities.200000
Claude 3 Sonnetclaude-3-sonnet-20240229Ideal balance of intelligence and speed for enterprise workloads, with multilingual and vision support.200000
Claude 3 Haikuclaude-3-haiku-20240307Fastest and most compact model for near-instant responsiveness, includes multilingual and vision capabilities.200000

OctoAI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Llama-3.1-Instruct (8B)meta-llama-3.1-8b-instructMeta's Llama-3.1-Instruct model with 8 billion parameters for chat use cases.131072
Llama-3.1-Instruct (70B)meta-llama-3.1-70b-instructMeta's Llama-3.1-Instruct model with 70 billion parameters for chat use cases.131072
Llama3-Instruct (8B)meta-llama-3-8b-instructMeta's Llama3-Instruct model with 8 billion parameters for chat use cases.8192
Llama3-Instruct (70B)meta-llama-3-70b-instructMeta's Llama3-Instruct model with 70 billion parameters for chat use cases.8192
Mistral Instruct v0.3 (7B)mistral-7b-instructMistral's Instruct v0.3 model with 7 billion parameters for chat and coding use cases.32768
Mixtral Instruct (8x7B)mixtral-8x7b-instructMistral's Mixtral Instruct model with 8x7 billion parameters for chat and coding use cases.32768
Nous Hermes 2 Mixtral DPO (8x7B)nous-hermes-2-mixtral-8x7b-dpoNous Research's Hermes 2 Mixtral DPO model with 8x7 billion parameters for content moderation.32768
Mixtral Instruct (8x22B)mixtral-8x22b-instructMistral's Mixtral Instruct model with 8x22 billion parameters for chat and coding use cases.65536
WizardLM-2 (8x22B)wizardlm-2-8x22bMicrosoft's WizardLM-2 model with 8x22 billion parameters for chat and coding use cases.65536
Llama Guard 2llamaguard-2-7bMeta's Llama Guard 2 model with 7 billion parameters for content moderation.4096

Perplexity AI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Llama-3.1-Sonar-Small (8B)llama-3.1-sonar-small-128k-onlineMeta's Llama-3.1-Sonar-Small model with 8 billion parameters for chat use cases.127072
Llama-3.1-Sonar-Large (70B)llama-3.1-sonar-large-128k-onlineMeta's Llama-3.1-Sonar-Large model with 70 billion parameters for chat use cases.127072
Llama-3.1-Sonar-Huge (405B)llama-3.1-sonar-huge-128k-onlineMeta's Llama-3.1-Sonar-Huge model with 405 billion parameters for chat use cases.127072

Amazon Bedrock

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Claude 3.7 Sonnetanthropic.claude-3-7-sonnet-20250219-v1:0Anthropic's Claude 3.7 Sonnet model on Amazon Bedrock200000
Claude 3.5 Sonnet (V2)anthropic.claude-3-5-sonnet-20241022-v2:0Anthropic's Claude 3.5 Sonnet model on Amazon Bedrock200000
Claude 3.5 Sonnetanthropic.claude-3-5-sonnet-20240620-v1:0Anthropic's Claude 3.5 Sonnet model on Amazon Bedrock200000
Claude 3 Sonnetanthropic.claude-3-sonnet-20240229-v1:0Anthropic's Claude 3 Sonnet model on Amazon Bedrock200000
Claude 3.5 Haikuanthropic.claude-3-5-haiku-20241022-v1:0Anthropic's Claude 3.5 Haiku model on Amazon Bedrock200000
Claude 3 Haikuanthropic.claude-3-haiku-20240307-v1:0Anthropic's Claude 3 Haiku model on Amazon Bedrock200000
Claude 3 Opusanthropic.claude-3-opus-20240229-v1:0Anthropic's Claude 3 Opus model on Amazon Bedrock200000
Llama 3 8B Instructmeta.llama3-8b-instruct-v1:0Meta's Llama 3 8B Instruct model on Amazon Bedrock4096
Llama 3 70B Instructmeta.llama3-70b-instruct-v1:0Meta's Llama 3 70B Instruct model on Amazon Bedrock4096
Llama 3.1 8B Instructmeta.llama3-1-8b-instruct-v1:0Meta's Llama 3.1 8B Instruct model on Amazon Bedrock128000
Llama 3.1 70B Instructmeta.llama3-1-70b-instruct-v1:0Meta's Llama 3.1 70B Instruct model on Amazon Bedrock128000
Llama 3.1 405B Instructmeta.llama3-1-405b-instruct-v1:0Meta's Llama 3.1 405B Instruct model on Amazon Bedrock128000
Llama 3.2 1B Instructus.meta.llama3-2-1b-instruct-v1:0Meta's Llama 3.2 1B Instruct model on Amazon Bedrock128000
Llama 3.2 3B Instructus.meta.llama3-2-3b-instruct-v1:0Meta's Llama 3.2 3B Instruct model on Amazon Bedrock128000
Llama 3.2 11B Instructus.meta.llama3-2-11b-instruct-v1:0Meta's Llama 3.2 11B Instruct model on Amazon Bedrock128000
Llama 3.2 90B Instructus.meta.llama3-2-90b-instruct-v1:0Meta's Llama 3.2 90B Instruct model on Amazon Bedrock128000

Azure OpenAI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
GPT-4.1gpt-4.1Most capable GPT-4.1 model for tasks requiring deep understanding and advanced reasoning.1047576
GPT-4.1 Minigpt-4.1-miniSmaller, faster version of GPT-4.1 optimized for efficiency.1047576
GPT-4.1 Nanogpt-4.1-nanoSmallest version of GPT-4.1 optimized for speed and cost efficiency.1047576
GPT-4ogpt-4oLatest large GA model with structured outputs, text/image processing, enhanced accuracy and superior performance in non-English languages and vision tasks.128000
GPT-4o minigpt-4o-miniLatest small GA model optimized for fast, inexpensive tasks. Supports text and image processing, JSON Mode, and parallel function calling.128000
o1o1o1 is a reasoning model designed to solve hard problems across domains. The o1 series of models are trained with reinforcement learning to perform complex reasoning. o1 models think before they answer, producing a long internal chain of thought before responding to the user.200000
o1-minio1-minio1-mini is a fast and affordable reasoning model for specialized tasks. The o1-mini series of models are trained with reinforcement learning to perform complex reasoning. o1-mini models think before they answer, producing a long internal chain of thought before responding to the user.128000
GPT-4gpt-4Most capable GPT-4 model for tasks requiring deep understanding and advanced reasoning.8192
GPT-3.5 Turbogpt-35-turboMost capable GPT-3.5 model, optimized for chat at 1/10th the cost of GPT-4.16385

xAI

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
Grok 3grok-3Grok 3 model with high performance capabilities. Choose this for reduced cost compared to grok-3-fast.131072
Grok 3 Latestgrok-3-latestLatest version of Grok 3 model with high performance capabilities.131072
Grok 3 Fastgrok-3-fastSame as Grok 3 model but optimized for latency-sensitive applications. Choose this for better response time at higher cost.131072
Grok 3 Fast Latestgrok-3-fast-latestLatest faster version of Grok 3 model with optimized response time.131072
Grok 3 Minigrok-3-miniLightweight version of Grok 3 model with lower cost and good performance.131072
Grok 3 Mini Latestgrok-3-mini-latestLatest lightweight version of Grok 3 model with lower cost and good performance.131072
Grok 3 Mini Fastgrok-3-mini-fastFaster lightweight version of Grok 3 model with balanced cost and performance.131072
Grok 3 Mini Fast Latestgrok-3-mini-fast-latestLatest faster lightweight version of Grok 3 model with balanced cost and performance.131072
Grok Betagrok-betaComparable performance to Grok 2 but with improved efficiency, speed and capabilities.131072
Grok Vision Betagrok-vision-betaComparable performance to Grok 2 but with improved efficiency, speed and capabilities and with ability to process images.8192

Fireworks

Model NameModel IDDescriptionMax TokensSupports ImagesSupports JSON SchemaSupports Function Calls
GPT-OSS 20Baccounts/fireworks/models/gpt-oss-20bA compact, open-weight language model optimized for low-latency and resource-constrained environments, including local and edge deployments. It shares the same Harmony training foundation and capabilities as 120B, with faster inference and easier deployment that is ideal for specialized or offline use cases, fast responsive performance, chain-of-thought output and adjustable reasoning levels, and agentic workflows.131072
GPT-OSS 120Baccounts/fireworks/models/gpt-oss-120bA high-performance, open-weight language model designed for production-grade, general-purpose use cases. It fits on a single H100 GPU, making it accessible without requiring multi-GPU infrastructure. Trained on the Harmony response format, it excels at complex reasoning and supports configurable reasoning effort, full chain-of-thought transparency for easier debugging and trust, and native agentic capabilities for function calling, tool use, and structured outputs.131072
Llama 4 Maverick Instruct (Basic)accounts/fireworks/models/llama4-maverick-instruct-basicThe Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.1000000
Llama 4 Scout Instruct (Basic)accounts/fireworks/models/llama4-scout-instruct-basicThe Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.128000
Qwen3 235B A22Baccounts/fireworks/models/qwen3-235b-a22bQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models32768
DeepSeek R1accounts/fireworks/models/deepseek-r1DeepSeek R1 is a large language model optimized for instruction following and coding tasks.160000
DeepSeek V3 03-24accounts/fireworks/models/deepseek-v3-0324DeepSeek V3 is a large language model optimized for instruction following. This model is the version of the DeepSeek V3 model as of 3/24/2025.128000
DeepSeek V3accounts/fireworks/models/deepseek-v3DeepSeek V3 is a large language model optimized for instruction following.128000
Llama 3.3 70B Instructaccounts/fireworks/models/llama-v3p3-70b-instructLlama 3.3 70B Instruct is a large language model that is optimized for instruction following.128000
Llama 3.1 405B Instructaccounts/fireworks/models/llama-v3p1-405b-instructLlama 3.1 405B Instruct is a large language model that is optimized for instruction following.128000
Llama 3.1 70B Instructaccounts/fireworks/models/llama-v3p1-70b-instructLlama 3.1 70B Instruct is a large language model that is optimized for instruction following.128000

Embedding Models

OpenAI

Model NameModel IDDescriptionMax TokensMax Output DimensionsSupports Reduced Dimensions
Text Embedding Ada 002text-embedding-ada-002Text Embedding Ada 00281911536
Text Embedding 3 Smalltext-embedding-3-smallIncreased performance over 2nd generation ada embedding model81911536
Text Embedding 3 Largetext-embedding-3-largeMost capable embedding model for both english and non-english tasks81913072

Cohere

Model NameModel IDDescriptionMax TokensMax Output DimensionsSupports Reduced Dimensions
Embed English v3.0embed-english-v3.0A model that allows for text to be classified or turned into embeddings. English only.5121024
Embed English Light v3.0embed-english-light-v3.0A smaller, faster version of embed-english-v3.0. Almost as capable, but a lot faster. English only.512384
Embed English v2.0embed-english-v2.0Our older embeddings model that allows for text to be classified or turned into embeddings. English only5124096
Embed English Light v2.0embed-english-light-v2.0A smaller, faster version of embed-english-v2.0. Almost as capable, but a lot faster. English only.5121024
Embed Multilingual v3.0embed-multilingual-v3.0Provides multilingual classification and embedding support. See supported languages here.5121024
Embed Multilingual Light v3.0embed-multilingual-light-v3.0A smaller, faster version of embed-multilingual-v3.0. Almost as capable, but a lot faster. Supports multiple languages.512384
Embed Multilingual v2.0embed-multilingual-v2.0Provides multilingual classification and embedding support. See supported languages here.256768

Amazon Bedrock

Model NameModel IDDescriptionMax TokensMax Output DimensionsSupports Reduced Dimensions
Cohere Embed Englishcohere.embed-english-v3Cohere English Embedding Model hosted on AWS Bedrock5121024
Cohere Embed Multilingualcohere.embed-multilingual-v3Cohere Multilingual Embedding Model hosted on AWS Bedrock5121024
Amazon Titan Embeddings G1 - Textamazon.titan-embed-text-v1Amazon's G1 Test Embedding Model hosted on AWS Bedrock81921024
Amazon Titan Embeddings V2 - Textamazon.titan-embed-text-v2:0Amazon's G2 Text Embedding Model hosted on AWS Bedrock81921024

Azure OpenAI

Model NameModel IDDescriptionMax TokensMax Output DimensionsSupports Reduced Dimensions
OpenAI embedding Largetext-embedding-3-largeOpenAI's Large Text Embedding Model hosted on Microsoft Azure81923072
OpenAI embedding Smalltext-embedding-3-smallOpenAI's Small Text Embedding Model hosted on Microsoft Azure81921536