Models
Language Model Providers
Ollama
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Llama3 7b | llama3:latest | Llama 3 | 8192 | ❌ | ❌ | ❌ |
Llama 2-7b | llama2:latest | Llama 2 | 8192 | ❌ | ❌ | ❌ |
Mistral | mistral:latest | Mistral | 8192 | ❌ | ❌ | ❌ |
Code Llama | codellama:7b-code | Code Llama | 8192 | ❌ | ❌ | ❌ |
Replicate
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Mixtral 8x7b instruct | mistralai/mixtral-8x7b-instruct-v0.1 | Mixtral 8x7b instruct | 128000 | ❌ | ❌ | ❌ |
Mistral 7b instruct v0.2 | mistralai/mistral-7b-instruct-v0.2 | The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an improved instruct fine-tuned version of Mistral-7B-Instruct-v0.1. | 128000 | ❌ | ❌ | ❌ |
Mistral 7b instruct v0.1 | mistral-7b-instruct-v0.1 | An instruction-tuned 7 billion parameter language model from Mistral | 128000 | ❌ | ❌ | ❌ |
Mixtral 8x7b instruct v0.1 | mistralai/mixtral-8x7b-instruct-v0.1 | The Mixtral-8x7B-instruct-v0.1 Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts tuned to be a helpful assistant. | 128000 | ❌ | ❌ | ❌ |
Llama 2 13b chat | meta/llama-2-13b-chat | A 13 billion parameter language model from Meta, fine tuned for chat completions | 128000 | ❌ | ❌ | ❌ |
Llama 2 70b chat | meta/llama-2-70b-chat | A 70 billion parameter language model from Meta, fine tuned for chat completions | 128000 | ❌ | ❌ | ❌ |
OpenAI
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
GPT-4o | gpt-4o | Advanced, multimodal flagship model that's cheaper and faster than GPT-4 Turbo | 128000 | ✅ | ✅ | ✅ |
GPT-4o 2024-04-09 | gpt-4-turbo-2024-04-09 | Advanced, multimodal flagship model that's cheaper and faster than GPT-4 Turbo | 128000 | ✅ | ✅ | ✅ |
GPT-4o-mini | gpt-4o-mini | Affordable and intelligent small model for fast, lightweight tasks. GPT-4o mini is cheaper and more capable than GPT-3.5 Turbo. Currently points to gpt-4o-mini-2024-07-18. | 128000 | ✅ | ✅ | ✅ |
GPT-4o-mini 2024-07-18 | gpt-4o-mini-2024-07-18 | 2024-07-18 version of gpt-4o-mini | 128000 | ✅ | ✅ | ✅ |
GPT-4o 2024-08-06 | gpt-4o-2024-08-06 | 2024-08-06 version of gpt-4o | 128000 | ✅ | ✅ | ✅ |
GPT-4 Turbo | gpt-4-turbo | The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. | 128000 | ✅ | ✅ | ✅ |
GPT-4 Turbo Preview | gpt-4-turbo-preview | The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Returns a maximum of 4,096 output tokens. This preview model is not yet suited for production traffic. | 128000 | ✅ | ✅ | ✅ |
GPT-4 Vision | gpt-4-vision-preview | GPT-4 with the ability to understand images, in addition to all other GPT-4 Turbo capabilities. | 128000 | ✅ | ✅ | ✅ |
GPT-4 | gpt-4 | More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. Will be updated with our latest model iteration. | 8192 | ❌ | ✅ | ✅ |
GPT-4 0613 | gpt-4-0613 | More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. Will be updated with our latest model iteration. | 8192 | ❌ | ✅ | ✅ |
GPT-4 32K | gpt-4-32k | Same capabilities as the base gpt-4 mode but with 4x the context length. Will be updated with our latest model iteration. | 32768 | ❌ | ✅ | ✅ |
GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | Most capable GPT-3.5 model and optimized for chat at 1/10th the cost of text-davinci-003. Will be updated with our latest model iteration. | 4096 | ❌ | ✅ | ✅ |
GPT-3.5 Turbo | gpt-3.5-turbo | Most capable GPT-3.5 model and optimized for chat at 1/10th the cost of text-davinci-003. Will be updated with our latest model iteration. | 4096 | ❌ | ✅ | ✅ |
GPT-3.5 Turbo 16K | gpt-3.5-turbo-16k | Same capabilities as the base gpt-3.5-turbo model but with 4x the context length. Will be updated with our latest model iteration. | 16384 | ❌ | ✅ | ✅ |
Groq
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Llama 3.1 405B Reasoning | llama-3.1-405b-reasoning | Llama 3.1 405B Reasoning | 131072 | ❌ | ✅ | ❌ |
Llama 3.1 70B Versatile (Tool Use Preview) | llama3-groq-70b-8192-tool-use-preview | Llama 3.1 70B Versatile (Tool Use Preview) | 8192 | ❌ | ✅ | ✅ |
Llama 3.1 70B Versatile | llama-3.1-70b-versatile | Llama 3.1 70B Versatile | 131072 | ❌ | ✅ | ❌ |
Llama 3.1 8B Instant (Tool Use Preview) | llama3-groq-8b-8192-tool-use-preview | Llama 3.1 8B Instant (Tool Use Preview) | 8192 | ❌ | ✅ | ✅ |
Llama 3.1 8B Instant | llama-3.1-8b-instant | Llama 3.1 8B Instant | 131072 | ❌ | ✅ | ✅ |
LLaMA3-70b | llama3-70b-8192 | LLaMA3-70b | 8192 | ❌ | ✅ | ✅ |
LLaMA3-8b | llama3-8b-8192 | LLaMA3-8b | 8192 | ❌ | ✅ | ✅ |
LLaMA2-70b | llama2-70b-4096 | LLaMA2-70b | 4096 | ❌ | ❌ | ❌ |
Mixtral-8x7b | mixtral-8x7b-32768 | Mixtral-8x7b | 32768 | ❌ | ✅ | ✅ |
Gemma-7b-it | gemma-7b-it | Gemma-7b-it | 8192 | ❌ | ✅ | ✅ |
Google Generative AI
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
gemini-pro | gemini-pro | gemini-pro | 32000 | ❌ | ❌ | ❌ |
Anthropic Claude
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Claude 3.5 Sonnet (V2) | claude-3-5-sonnet-20241022 | Most intelligent model, includes multilingual and vision capabilities. This is the latest version of the model. | 200000 | ✅ | ✅ | ✅ |
Claude 3.5 Sonnet (V1) | claude-3-5-sonnet-20240620 | Most intelligent model, includes multilingual and vision capabilities. This is the first version of the model. | 200000 | ✅ | ✅ | ✅ |
Claude 3 Opus | claude-3-opus-20240229 | Most powerful model for highly complex tasks, offering top-level performance with multilingual and vision capabilities. | 200000 | ✅ | ✅ | ✅ |
Claude 3 Sonnet | claude-3-sonnet-20240229 | Ideal balance of intelligence and speed for enterprise workloads, with multilingual and vision support. | 200000 | ✅ | ✅ | ✅ |
Claude 3 Haiku | claude-3-haiku-20240307 | Fastest and most compact model for near-instant responsiveness, includes multilingual and vision capabilities. | 200000 | ✅ | ✅ | ✅ |
OctoAI
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Llama-3.1-Instruct (8B) | meta-llama-3.1-8b-instruct | Meta's Llama-3.1-Instruct model with 8 billion parameters for chat use cases. | 131072 | ❌ | ✅ | ✅ |
Llama-3.1-Instruct (70B) | meta-llama-3.1-70b-instruct | Meta's Llama-3.1-Instruct model with 70 billion parameters for chat use cases. | 131072 | ❌ | ✅ | ✅ |
Llama3-Instruct (8B) | meta-llama-3-8b-instruct | Meta's Llama3-Instruct model with 8 billion parameters for chat use cases. | 8192 | ❌ | ✅ | ✅ |
Llama3-Instruct (70B) | meta-llama-3-70b-instruct | Meta's Llama3-Instruct model with 70 billion parameters for chat use cases. | 8192 | ❌ | ✅ | ✅ |
Mistral Instruct v0.3 (7B) | mistral-7b-instruct | Mistral's Instruct v0.3 model with 7 billion parameters for chat and coding use cases. | 32768 | ❌ | ❌ | ❌ |
Mixtral Instruct (8x7B) | mixtral-8x7b-instruct | Mistral's Mixtral Instruct model with 8x7 billion parameters for chat and coding use cases. | 32768 | ❌ | ❌ | ❌ |
Nous Hermes 2 Mixtral DPO (8x7B) | nous-hermes-2-mixtral-8x7b-dpo | Nous Research's Hermes 2 Mixtral DPO model with 8x7 billion parameters for content moderation. | 32768 | ❌ | ❌ | ❌ |
Mixtral Instruct (8x22B) | mixtral-8x22b-instruct | Mistral's Mixtral Instruct model with 8x22 billion parameters for chat and coding use cases. | 65536 | ❌ | ❌ | ❌ |
WizardLM-2 (8x22B) | wizardlm-2-8x22b | Microsoft's WizardLM-2 model with 8x22 billion parameters for chat and coding use cases. | 65536 | ❌ | ❌ | ❌ |
Llama Guard 2 | llamaguard-2-7b | Meta's Llama Guard 2 model with 7 billion parameters for content moderation. | 4096 | ❌ | ❌ | ❌ |
Perplexity AI
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Llama-3.1-Sonar-Small (8B) | llama-3.1-sonar-small-128k-online | Meta's Llama-3.1-Sonar-Small model with 8 billion parameters for chat use cases. | 127072 | ❌ | ❌ | ❌ |
Llama-3.1-Sonar-Large (70B) | llama-3.1-sonar-large-128k-online | Meta's Llama-3.1-Sonar-Large model with 70 billion parameters for chat use cases. | 127072 | ❌ | ❌ | ❌ |
Llama-3.1-Sonar-Huge (405B) | llama-3.1-sonar-huge-128k-online | Meta's Llama-3.1-Sonar-Huge model with 405 billion parameters for chat use cases. | 127072 | ❌ | ❌ | ❌ |
Amazon Bedrock
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
Claude 3 Sonnet | anthropic.claude-3-sonnet-20240229-v1:0 | Anthropic's Claude 3 Sonnet model on Amazon Bedrock | 200000 | ✅ | ✅ | ❌ |
Claude 3.5 Sonnet | anthropic.claude-3-5-sonnet-20240620-v1:0 | Anthropic's Claude 3.5 Sonnet model on Amazon Bedrock | 200000 | ✅ | ✅ | ✅ |
Claude 3 Haiku | anthropic.claude-3-haiku-20240307-v1:0 | Anthropic's Claude 3 Haiku model on Amazon Bedrock | 200000 | ✅ | ✅ | ❌ |
Claude 3 Opus | anthropic.claude-3-opus-20240229-v1:0 | Anthropic's Claude 3 Opus model on Amazon Bedrock | 200000 | ✅ | ✅ | ❌ |
Llama 3 8B Instruct | meta.llama3-8b-instruct-v1:0 | Meta's Llama 3 8B Instruct model on Amazon Bedrock | 4096 | ❌ | ❌ | ❌ |
Llama 3 70B Instruct | meta.llama3-70b-instruct-v1:0 | Meta's Llama 3 70B Instruct model on Amazon Bedrock | 4096 | ❌ | ❌ | ❌ |
Llama 3.1 8B Instruct | meta.llama3-1-8b-instruct-v1:0 | Meta's Llama 3.1 8B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.1 70B Instruct | meta.llama3-1-70b-instruct-v1:0 | Meta's Llama 3.1 70B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.1 405B Instruct | meta.llama3-1-405b-instruct-v1:0 | Meta's Llama 3.1 405B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.2 1B Instruct | us.meta.llama3-2-1b-instruct-v1:0 | Meta's Llama 3.2 1B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.2 3B Instruct | us.meta.llama3-2-3b-instruct-v1:0 | Meta's Llama 3.2 3B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.2 11B Instruct | us.meta.llama3-2-11b-instruct-v1:0 | Meta's Llama 3.2 11B Instruct model on Amazon Bedrock | 128000 | ❌ | ❌ | ❌ |
Llama 3.2 90B Instruct | us.meta.llama3-2-90b-instruct-v1:0 | Meta's Llama 3.2 90B Instruct model on Amazon Bedrock | 128000 | ❌ | ✅ | ✅ |
Azure OpenAI
Model Name | Model ID | Description | Max Tokens | Supports Images | Supports JSON Schema | Supports Function Calls |
---|---|---|---|---|---|---|
GPT-4o mini | gpt-4o-mini | Latest small GA model optimized for fast, inexpensive tasks. Supports text and image processing, JSON Mode, and parallel function calling. | 128000 | ✅ | ✅ | ✅ |
GPT-4o | gpt-4o | Latest large GA model with structured outputs, text/image processing, enhanced accuracy and superior performance in non-English languages and vision tasks. | 128000 | ✅ | ✅ | ✅ |
GPT-4 | gpt-4 | Most capable GPT-4 model for tasks requiring deep understanding and advanced reasoning. | 8192 | ❌ | ✅ |