You are here:
Large Language Model Support
Understand supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Identify the Salesforce-managed models that are available out of the box. Learn how you can bring your own model (BYOLLM) by using AI Models (formerly Einstein Studio).
Agentforce Models
This page focuses on supported LLMs for embedded features, such as Prompt Builder. For model options in Agentforce, see Select Agentforce Model Option.
Salesforce-Managed Models
Quickly get started with generative AI features by choosing a Salesforce-managed model. Features like Prompt Builder and the Models API allow you to customize AI implementations with different models and use them in your apps. Salesforce-managed models are enabled by default to speed up the configuration process.
This table lists the Salesforce-managed models that are available for embedded features, such as Prompt Builder. For Agentforce, see the Agentforce Models section.
| Model Provider | Model Family | Version | Usage Type | Model Regions | Notes |
|---|---|---|---|---|---|
| Bedrock (Amazon) | Nova Lite | nova-lite-2024-12-04 | Basic Prompts | Inference profiles: apac.amazon.nova-lite-v1:0, ca.amazon.nova-lite-v1:0, eu.amazon.nova-lite-v1:0, us.amazon.nova-lite-v1:0 | |
| Bedrock (Amazon) | Nova Pro | nova-pro-2024-12-04 | Standard Prompts | Inference profiles: apac.amazon.nova-pro-v1:0, eu.amazon.nova-pro-v1:0, us.amazon.nova-pro-v1:0 | |
| Bedrock (Anthropic) | Claude Haiku 4.5 | claude-haiku-4-5-20251001 | Standard Prompts | Inference profiles: eu.anthropic.claude-haiku-4-5-20251001-v1:0, global.anthropic.claude-haiku-4-5-20251001-v1:0, us.anthropic.claude-haiku-4-5-20251001-v1:0 | Reasoning is not supported |
| Bedrock (Anthropic) | Claude Opus 4.5 | claude-opus-4-5-20251101 | Advanced Prompts | United States | Reasoning is not supported |
| Bedrock (Anthropic) | Claude Opus 4.6 (Beta) | claude-opus-4-6-2026-02-05 | Advanced Prompts | Inference profiles: au.anthropic.claude-opus-4-6-v1, eu.anthropic.claude-opus-4-6-v1, us.anthropic.claude-opus-4-6-v1 | Reasoning is not supported |
| Bedrock (Anthropic) | Claude Opus 4.7 (Beta) | claude-opus-4-7-2026-04-16 | Advanced Prompts | Inference profiles: eu.anthropic.claude-opus-4-7, jp.anthropic.claude-opus-4-7, us.anthropic.claude-opus-4-7 | Reasoning is not supported |
| Bedrock (Anthropic) | Claude Sonnet 4 | claude-sonnet-4-20250514 | Standard Prompts | Inference profiles: apac.anthropic.claude-sonnet-4-20250514-v1:0, eu.anthropic.claude-sonnet-4-20250514-v1:0, us.anthropic.claude-sonnet-4-20250514-v1:0 | |
| Bedrock (Anthropic) | Claude Sonnet 4.5 | claude-sonnet-4-5-20250929 | Standard Prompts | Inference profiles: au.anthropic.claude-sonnet-4-5-20250929-v1:0, eu.anthropic.claude-sonnet-4-5-20250929-v1:0, global.anthropic.claude-sonnet-4-5-20250929-v1:0, jp.anthropic.claude-sonnet-4-5-20250929-v1:0, us.anthropic.claude-sonnet-4-5-20250929-v1:0 | Reasoning is not supported |
| Bedrock (Anthropic) | Claude Sonnet 4.6 | claude-sonnet-4-6-2026-02-17 | Standard Prompts | Inference profiles: au.anthropic.claude-sonnet-4-6, eu.anthropic.claude-sonnet-4-6, jp.anthropic.claude-sonnet-4-6, us.anthropic.claude-sonnet-4-6 | Reasoning is not supported |
| Bedrock (NVIDIA) | Nemotron 3 Nano 30B (Beta) | nvidia.nemotron-nano-3-30b | Basic Prompts | Brazil, India, Italy, Japan, United Kingdom, United States | |
| OpenAI and Azure OpenAI | GPT-4o (GPT 4 Omni) | gpt-4o-2024-11-20 | Standard Prompts | OpenAI: United States Azure OpenAI: Australia, Brazil, Canada, France, Germany, India, Japan, Sweden, Switzerland, United Kingdom, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI | GPT-4o Mini | gpt-4o-mini-2024-07-18 | Basic Prompts | United States | |
| OpenAI and Azure OpenAI | GPT-4o-mini (GPT 4 Omni Mini) | gpt-4o-mini-2024-07-18 | Basic Prompts | OpenAI: United States Azure OpenAI: France, Germany, Japan, Sweden, United Kingdom, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI and Azure OpenAI | GPT-4.1 | gpt-4.1-2025-04-14 | Standard Prompts | OpenAI: United States Azure OpenAI: Australia, Brazil, France, Germany, India, Japan, Singapore, Sweden, United Kingdom, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI and Azure OpenAI | GPT-4.1 Mini | gpt-4.1-mini-2025-04-14 | Basic Prompts | OpenAI: United States Azure OpenAI: Australia, Canada, India, Japan, United Kingdom, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI and Azure OpenAI | GPT-5 | gpt-5-2025-08-07 | Standard Prompts | OpenAI: United States Azure OpenAI: Sweden, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI and Azure OpenAI | GPT-5 Mini | gpt-5-mini-2025-08-07 | Basic Prompts | OpenAI: United States Azure OpenAI: Sweden, United States |
See Geo-Aware Routing for OpenAI and Azure OpenAI. |
| OpenAI and Azure OpenAI | GPT 5.1 | gpt-5.1-2025-11-13 | Standard Prompts | OpenAI: United States Azure OpenAI: Sweden, United States |
Reasoning is not supported |
| OpenAI and Azure OpenAI | GPT 5.2 | gpt-5.2-2025-12-11 | Standard Prompts | OpenAI: United States Azure OpenAI:United States |
Reasoning is not supported |
| OpenAI and Azure OpenAI | GPT 5.4 | gpt-5.4-2026-03-05 | Standard Prompts | OpenAI: United States Azure OpenAI:United States |
Reasoning is not supported |
| OpenAI and Azure OpenAI | GPT 5.4 Mini (Beta) | gpt-5.4-mini-2026-03-17 | Basic Prompts | United States | Reasoning is not supported |
| OpenAI and Azure OpenAI | GPT 5.5 (Beta) | gpt-5.5-2026-04-24 | Advanced Prompts | United States | Reasoning is not supported |
| OpenAI and Azure OpenAI | O3 | o3-2025-04-16 | Standard Prompts | OpenAI: United States Azure OpenAI: France, Germany, Sweden, United States |
|
| OpenAI and Azure OpenAI | O4 Mini | o4-mini-2025-04-16 | Standard Prompts | OpenAI: United States Azure OpenAI: France, Germany, Sweden, United States |
|
| Vertex AI (Google) | Gemini 2.5 Flash | gemini-2.5-flash-2025-06-17 | Basic Prompts | Australia, Canada, India, Japan, Netherlands, Singapore, South Korea, United Kingdom, United States | |
| Vertex AI (Google) | Gemini 2.5 Flash Lite | gemini-2.5-flash-lite-2025-07-22 | Basic Prompts | Netherlands, United States | |
| Vertex AI (Google) | Gemini 2.5 Pro | gemini-2.5-pro-2025-06-17 | Standard Prompts | Netherlands, United States | |
| Vertex AI (Google) | Gemini 3 Flash (Beta) | gemini-3-flash-preview-2025-12-17 | Basic Prompts | United States | |
| Vertex AI (Google) | Gemini 3 Pro (Beta) | gemini-3-pro-preview-2025-11-18 | Standard Prompts | United States | Reasoning is not supported. Retiring on April 23, 2026. |
| Vertex AI (Google) | Gemini 3.1 Flash Lite (Beta) | gemini-3.1-flash-lite-preview-2026-03-03 | Basic Prompts | United States | |
| Vertex AI (Google) | Gemini 3.1 Pro (Beta) | gemini-3.1-pro-preview-2026-02-19 | Standard Prompts | United States | Reasoning is not supported |
- In Setup, a Salesforce admin can disable a model provider. See Manage Model Provider Access.
- In AI Models, a Salesforce admin can hide an LLM configuration from being selected in Prompt Builder. See Manage Large Language Model (LLM) Access by Hiding Configurations.
For more details about Usage Type, see Agentforce and Generative AI Usage and Billing.
For more details about these supported models, see Supported Models in the Agentforce Developer Guide.
Inference Profiles
Some Anthropic models are accessible in particular AWS Regions only as a cross-region inference profile. Cross-region inference requests are kept within the AWS regions that are part of the geography where the request originates. For example, a request made within the United States to Claude Sonnet 4 is kept within the AWS Regions in the United States. A request made in Japan can be serviced by any of the destinations in the apac.anthropic.claude-sonnet-4-20250514-v1:0 inference profile.
For more information, see Amazon Bedrock documentation:
For routing for each Salesforce org region for Claude Sonnet 4, see Geo-Aware Routing for Anthropic.
Model Limits
For information about limits per model, such as requests per minute (RPM) and token limits, see Large Language Model Limits.
Beta Models
Beta models are new models from model providers that Salesforce is beta testing. Beta models typically have lower rate limits and may not be available in all regions. A beta model has (Beta) appended to its name. If beta models aren't turned on, they appear as (Disabled) in AI Models.
Before you can turn on beta generative AI models, you must turn on Data 360. Data 360 is automatically provisioned as soon as a Data Cloud license is added to your Salesforce org. See Turn On Data 360.
To turn on beta generative AI models, go to Einstein Setup. After beta models are enabled, you can see them in AI Models and use them just like any Salesforce-managed model.
We recommend that you enable beta models in sandbox or development orgs only.
Bring Your Own Large Language Model (BYOLLM)
The Einstein platform allows you to customize your AI experience by bringing in your own models to Salesforce. You can bring in your own model by using AI Models and write a prompt template in Prompt Builder, which you can then integrate into your own apps or an agent. Some common reasons companies want to use different models with Einstein include:
- Your company has an LLM fine-tuned to your data.
- You can use your Azure, Bedrock, OpenAI, or Vertex account.
BYOLLM supports many of the Salesforce-managed models and these additional models:
| Model Provider | Model Family | Notes |
| Bedrock (Anthropic) | Claude 3 Opus | Retired by Bedrock |
| Bedrock (Anthropic) | Claude 3 Sonnet | Retired by Bedrock |
| Bedrock (Anthropic) | Claude 3.5 Sonnet | Retired by Bedrock |
| Vertex AI (Google) | Gemini 1.5 Pro | Retired by Vertex AI |
To use models and providers not listed on this page, see the LLM Open Connector.
Develop LLM Solutions with the Models API
Developers can use Models API to code custom solutions. See the Models API Developer Guide.
Sustainability
Sustainability is a core value at Salesforce. Selecting the appropriate model is one of the most effective ways to reduce energy consumption, water usage, and carbon emissions. Compare the environmental impact of these models by using the relative Sustainability Score in the bottom-right section of the Agentic Benchmark page on the AI Research site. For more details on Salesforce's approach to AI sustainability, see Sustainability at Salesforce.
Deprecated Models
Model deprecation is the process by which a model provider gradually phases out a model, usually in favor of a new and improved model. A deprecated model may reroute to a preferred model to ensure continuity of service. To learn more, see Prepare for Model Deprecation and Rerouting.
We recommend that you start migrating your applications as soon as the deprecation is announced. During migration, update and test each part of your application with the replacement model that we recommend.
These models are deprecated or rerouted.
| Deprecated Model | Recommended Replacement | Deprecated Date | Reroute Date |
| Bedrock (Anthropic) Claude 4 Sonnet | Claude Sonnet 4.6 | May 4, 2026 | May 26, 2026 |
| Vertex AI (Google) Gemini 3 Pro (Beta) | Gemini 3.1 Pro (Beta) | Mar 23, 2026 | N/A. Retiring on Apr 23, 2026. |
| Bedrock (Anthropic) Claude 3.7 Sonnet | Claude Sonnet 4.5 | Jan 6, 2026 | Feb 26, 2026 |
| Bedrock (Anthropic) Claude 3 Haiku | Claude Haiku 4.5 | Jan 6, 2026 | Feb 26, 2026 |
| Vertex AI (Google) Gemini 2.0 Flash | Gemini 2.5 Flash | Jan 20, 2026 | Feb 20, 2026 |
| Vertex AI (Google) Gemini 2.0 Flash | Gemini 2.5 Flash Lite | Jan 20, 2026 | Feb 20, 2026 |
| Azure OpenAI GPT 3.5 Turbo | GPT 4 Omni | Jun 16, 2025 | Jul 16, 2025 |
| OpenAI GPT 3.5 Turbo | GPT 4 Omni Mini | Jun 16, 2025 | Jul 16, 2025 |
| OpenAI GPT 4 | GPT 4 Omni | Jun 2, 2025 | Jun 30, 2025 |
| OpenAI GPT 4 Turbo | GPT 4 Omni | May 6, 2025 | Jun 30, 2025 |
| OpenAI GPT 4 32k | GPT 4 Omni | Jun 6, 2024 | Jun 6, 2025 |
| Azure OpenAI GPT 4 Turbo | GPT 4 Omni | April 7, 2025 | May 1, 2025 |
| Azure OpenAI GPT 3.5 Turbo 16k | Azure OpenAI GPT 3.5 Turbo | Nov 6, 2023 | Nov 13, 2024 |
| OpenAI GPT 3.5 Turbo 16k | GPT 3.5 Turbo | Nov 6, 2023 | Sep 13, 2024 |
Rerouted Models
These models are rerouted.
| Model Provider | Model Family | Version | Rerouted To |
|---|---|---|---|
| Azure OpenAI | GPT-3.5 Turbo | gpt-3.5-turbo-0613 | GPT 4 Omni Mini |
| Azure OpenAI | GPT-3.5 Turbo 16K | gpt-35-turbo-16k-0613 | GPT 4 Omni Mini |
| Azure OpenAI | GPT-4 Turbo | gpt-4-1106-Preview | GPT 4 Omni |
| Bedrock (Anthropic) | Claude 3 Haiku | claude-3-haiku-20240307 | Claude Haiku 4.5 |
| Bedrock (Anthropic) | Claude 3.7 Sonnet | claude-3-7-sonnet-20250219 | Claude Sonnet 4.5 |
| OpenAI | GPT-3.5 Turbo | gpt-3.5-turbo-0125 | OpenAI GPT 4 Omni Mini |
| OpenAI | GPT-3.5 Turbo 16K | gpt-3.5-turbo-16k | OpenAI GPT 4 Omni Mini |
| OpenAI | GPT-4 | gpt-4-0613 | GPT 4 Omni |
| OpenAI | GPT-4 32K | gpt-4-32k-0613 | GPT 4 Omni |
| OpenAI | GPT-4 Turbo | gpt-4-0125-preview | GPT 4 Omni |
| Vertex AI (Google) | Gemini 2.0 Flash | gemini-2.0-flash-001 | Gemini 2.5 Flash |
| Vertex AI (Google) | Gemini 2.0 Flash Lite | gemini-2.0-flash-lite-001 | Gemini 2.5 Flash Lite |
Announcing New and Deprecated Models
New model announcements and model deprecation announcements are part of the monthly Einstein Platform release notes.
- Manage Model Provider Access
Choose which Large Language Model (LLM) providers to allow or not allow in your organization. When access to a model provider is turned on, you can use its language learning models (LLMs) in agents, prompt templates, APIs, and other features in generative AI solutions. Turn off a model provider to block access to its models in your org. - Prepare for Model Deprecation and Rerouting
This document provides guidance on retesting Salesforce AI implementations during model deprecation and rerouting. Deprecation and rerouting are considered temporary. If your model is in one of these states, consider switching to a new model. - Large Language Model Multimodal Support
Salesforce-managed models have different levels of support and limits for including JPG, PNG, or PDF files in a model request. - Large Language Model Limits
Understand limits for supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Limits for each model include requests per minute and token limits. - Salesforce-Owned Models
Salesforce AI Research creates, trains, and fine tunes models to address specific Salesforce use cases. These models are hosted on AWS within the Salesforce Trust Boundary. - Batch Models
Use Prompt Template Batch Processing to generate large quantities of responses for prompt templates asynchronously.

