Large Language Model Support

You are here:

Large Language Model Support

Understand supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Identify the Salesforce-managed models that are available out of the box. Learn how you can bring your own model (BYOLLM) by using AI Models (formerly Einstein Studio).

Note Changing the model can affect your usage, see Einstein Usage.

Agentforce Models

This page focuses on supported LLMs for embedded features, such as Prompt Builder. For model options in Agentforce, see Select Agentforce Model Option.

Salesforce-Managed Models

Quickly get started with generative AI features by choosing a Salesforce-managed model. Features like Prompt Builder and the Models API allow you to customize AI implementations with different models and use them in your apps. Salesforce-managed models are enabled by default to speed up the configuration process.

This table lists the Salesforce-managed models that are available for embedded features, such as Prompt Builder. For Agentforce, see the Agentforce Models section.

Model Provider	Model Family	Version	Usage Type	Model Regions	Notes
Bedrock (Amazon)	Nova Lite	nova-lite-2024-12-04	Basic Prompts	Inference profiles: apac.amazon.nova-lite-v1:0, ca.amazon.nova-lite-v1:0, eu.amazon.nova-lite-v1:0, us.amazon.nova-lite-v1:0
Bedrock (Amazon)	Nova Pro	nova-pro-2024-12-04	Standard Prompts	Inference profiles: apac.amazon.nova-pro-v1:0, eu.amazon.nova-pro-v1:0, us.amazon.nova-pro-v1:0
Bedrock (Anthropic)	Claude Haiku 4.5	claude-haiku-4-5-20251001	Standard Prompts	Inference profiles: eu.anthropic.claude-haiku-4-5-20251001-v1:0, global.anthropic.claude-haiku-4-5-20251001-v1:0, us.anthropic.claude-haiku-4-5-20251001-v1:0	Reasoning is not supported
Bedrock (Anthropic)	Claude Opus 4.5	claude-opus-4-5-20251101	Advanced Prompts	United States	Reasoning is not supported
Bedrock (Anthropic)	Claude Opus 4.6 (Beta)	claude-opus-4-6-2026-02-05	Advanced Prompts	Inference profiles: au.anthropic.claude-opus-4-6-v1, eu.anthropic.claude-opus-4-6-v1, us.anthropic.claude-opus-4-6-v1	Reasoning is not supported
Bedrock (Anthropic)	Claude Opus 4.7 (Beta)	claude-opus-4-7-2026-04-16	Advanced Prompts	Inference profiles: eu.anthropic.claude-opus-4-7, jp.anthropic.claude-opus-4-7, us.anthropic.claude-opus-4-7	Reasoning is not supported
Bedrock (Anthropic)	Claude Sonnet 4	claude-sonnet-4-20250514	Standard Prompts	Inference profiles: apac.anthropic.claude-sonnet-4-20250514-v1:0, eu.anthropic.claude-sonnet-4-20250514-v1:0, us.anthropic.claude-sonnet-4-20250514-v1:0
Bedrock (Anthropic)	Claude Sonnet 4.5	claude-sonnet-4-5-20250929	Standard Prompts	Inference profiles: au.anthropic.claude-sonnet-4-5-20250929-v1:0, eu.anthropic.claude-sonnet-4-5-20250929-v1:0, global.anthropic.claude-sonnet-4-5-20250929-v1:0, jp.anthropic.claude-sonnet-4-5-20250929-v1:0, us.anthropic.claude-sonnet-4-5-20250929-v1:0	Reasoning is not supported
Bedrock (Anthropic)	Claude Sonnet 4.6	claude-sonnet-4-6-2026-02-17	Standard Prompts	Inference profiles: au.anthropic.claude-sonnet-4-6, eu.anthropic.claude-sonnet-4-6, jp.anthropic.claude-sonnet-4-6, us.anthropic.claude-sonnet-4-6	Reasoning is not supported
Bedrock (NVIDIA)	Nemotron 3 Nano 30B (Beta)	nvidia.nemotron-nano-3-30b	Basic Prompts	Brazil, India, Italy, Japan, United Kingdom, United States
OpenAI and Azure OpenAI	GPT-4o (GPT 4 Omni)	gpt-4o-2024-11-20	Standard Prompts	OpenAI: United States Azure OpenAI: Australia, Brazil, Canada, France, Germany, India, Japan, Sweden, Switzerland, United Kingdom, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI	GPT-4o Mini	gpt-4o-mini-2024-07-18	Basic Prompts	United States
OpenAI and Azure OpenAI	GPT-4o-mini (GPT 4 Omni Mini)	gpt-4o-mini-2024-07-18	Basic Prompts	OpenAI: United States Azure OpenAI: France, Germany, Japan, Sweden, United Kingdom, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI and Azure OpenAI	GPT-4.1	gpt-4.1-2025-04-14	Standard Prompts	OpenAI: United States Azure OpenAI: Australia, Brazil, France, Germany, India, Japan, Singapore, Sweden, United Kingdom, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI and Azure OpenAI	GPT-4.1 Mini	gpt-4.1-mini-2025-04-14	Basic Prompts	OpenAI: United States Azure OpenAI: Australia, Canada, India, Japan, United Kingdom, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI and Azure OpenAI	GPT-5	gpt-5-2025-08-07	Standard Prompts	OpenAI: United States Azure OpenAI: Sweden, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI and Azure OpenAI	GPT-5 Mini	gpt-5-mini-2025-08-07	Basic Prompts	OpenAI: United States Azure OpenAI: Sweden, United States	See Geo-Aware Routing for OpenAI and Azure OpenAI.
OpenAI and Azure OpenAI	GPT 5.1	gpt-5.1-2025-11-13	Standard Prompts	OpenAI: United States Azure OpenAI: Sweden, United States	Reasoning is not supported
OpenAI and Azure OpenAI	GPT 5.2	gpt-5.2-2025-12-11	Standard Prompts	OpenAI: United States Azure OpenAI:United States	Reasoning is not supported
OpenAI and Azure OpenAI	GPT 5.4	gpt-5.4-2026-03-05	Standard Prompts	OpenAI: United States Azure OpenAI:United States	Reasoning is not supported
OpenAI and Azure OpenAI	GPT 5.4 Mini (Beta)	gpt-5.4-mini-2026-03-17	Basic Prompts	United States	Reasoning is not supported
OpenAI and Azure OpenAI	GPT 5.5 (Beta)	gpt-5.5-2026-04-24	Advanced Prompts	United States	Reasoning is not supported
OpenAI and Azure OpenAI	O3	o3-2025-04-16	Standard Prompts	OpenAI: United States Azure OpenAI: France, Germany, Sweden, United States
OpenAI and Azure OpenAI	O4 Mini	o4-mini-2025-04-16	Standard Prompts	OpenAI: United States Azure OpenAI: France, Germany, Sweden, United States
Vertex AI (Google)	Gemini 2.5 Flash	gemini-2.5-flash-2025-06-17	Basic Prompts	Australia, Canada, India, Japan, Netherlands, Singapore, South Korea, United Kingdom, United States
Vertex AI (Google)	Gemini 2.5 Flash Lite	gemini-2.5-flash-lite-2025-07-22	Basic Prompts	Netherlands, United States
Vertex AI (Google)	Gemini 2.5 Pro	gemini-2.5-pro-2025-06-17	Standard Prompts	Netherlands, United States
Vertex AI (Google)	Gemini 3 Flash (Beta)	gemini-3-flash-preview-2025-12-17	Basic Prompts	United States
Vertex AI (Google)	Gemini 3 Pro (Beta)	gemini-3-pro-preview-2025-11-18	Standard Prompts	United States	Reasoning is not supported. Retiring on April 23, 2026.
Vertex AI (Google)	Gemini 3.1 Flash Lite (Beta)	gemini-3.1-flash-lite-preview-2026-03-03	Basic Prompts	United States
Vertex AI (Google)	Gemini 3.1 Pro (Beta)	gemini-3.1-pro-preview-2026-02-19	Standard Prompts	United States	Reasoning is not supported

Note All the Salesforce-managed models might not be available in your org.

In Setup, a Salesforce admin can disable a model provider. See Manage Model Provider Access.
In AI Models, a Salesforce admin can hide an LLM configuration from being selected in Prompt Builder. See Manage Large Language Model (LLM) Access by Hiding Configurations.

For more details about Usage Type, see Agentforce and Generative AI Usage and Billing.

For more details about these supported models, see Supported Models in the Agentforce Developer Guide.

Inference Profiles

Some Anthropic models are accessible in particular AWS Regions only as a cross-region inference profile. Cross-region inference requests are kept within the AWS regions that are part of the geography where the request originates. For example, a request made within the United States to Claude Sonnet 4 is kept within the AWS Regions in the United States. A request made in Japan can be serviced by any of the destinations in the apac.anthropic.claude-sonnet-4-20250514-v1:0 inference profile.

For more information, see Amazon Bedrock documentation:

For routing for each Salesforce org region for Claude Sonnet 4, see Geo-Aware Routing for Anthropic.

Model Limits

For information about limits per model, such as requests per minute (RPM) and token limits, see Large Language Model Limits.

Beta Models

Beta models are new models from model providers that Salesforce is beta testing. Beta models typically have lower rate limits and may not be available in all regions. A beta model has (Beta) appended to its name. If beta models aren't turned on, they appear as (Disabled) in AI Models.

Note This feature is a pilot or beta service that is subject to the Beta Services Terms at Agreements - Salesforce.com or a written Unified Pilot Agreement if executed by Customer, and applicable terms in the Product Terms Directory. Use of this pilot or beta service is at the Customer's sole discretion.

Before you can turn on beta generative AI models, you must turn on Data 360. Data 360 is automatically provisioned as soon as a Data Cloud license is added to your Salesforce org. See Turn On Data 360.

To turn on beta generative AI models, go to Einstein Setup. After beta models are enabled, you can see them in AI Models and use them just like any Salesforce-managed model.

We recommend that you enable beta models in sandbox or development orgs only.

Bring Your Own Large Language Model (BYOLLM)

The Einstein platform allows you to customize your AI experience by bringing in your own models to Salesforce. You can bring in your own model by using AI Models and write a prompt template in Prompt Builder, which you can then integrate into your own apps or an agent. Some common reasons companies want to use different models with Einstein include:

Your company has an LLM fine-tuned to your data.
You can use your Azure, Bedrock, OpenAI, or Vertex account.

BYOLLM supports many of the Salesforce-managed models and these additional models:

Model Provider	Model Family	Notes
Bedrock (Anthropic)	Claude 3 Opus	Retired by Bedrock
Bedrock (Anthropic)	Claude 3 Sonnet	Retired by Bedrock
Bedrock (Anthropic)	Claude 3.5 Sonnet	Retired by Bedrock
Vertex AI (Google)	Gemini 1.5 Pro	Retired by Vertex AI

To use models and providers not listed on this page, see the LLM Open Connector.

Note For BYOLLM and LLM Open Connector, Salesforce uses a set of IP addresses to communicate with an LLM that you host yourself. If your security policy requires a strict network Access Control List (ACL), make sure to add to your allowlists the IP addresses in the BYO Models and Open Connector IP Addresses section of Salesforce Core Services - IP Addresses and Domains to Allow.

Develop LLM Solutions with the Models API

Developers can use Models API to code custom solutions. See the Models API Developer Guide.

Sustainability

Sustainability is a core value at Salesforce. Selecting the appropriate model is one of the most effective ways to reduce energy consumption, water usage, and carbon emissions. Compare the environmental impact of these models by using the relative Sustainability Score in the bottom-right section of the Agentic Benchmark page on the AI Research site. For more details on Salesforce's approach to AI sustainability, see Sustainability at Salesforce.

Deprecated Models

Model deprecation is the process by which a model provider gradually phases out a model, usually in favor of a new and improved model. A deprecated model may reroute to a preferred model to ensure continuity of service. To learn more, see Prepare for Model Deprecation and Rerouting.

We recommend that you start migrating your applications as soon as the deprecation is announced. During migration, update and test each part of your application with the replacement model that we recommend.

These models are deprecated or rerouted.

Deprecated Model	Recommended Replacement	Deprecated Date	Reroute Date
Bedrock (Anthropic) Claude 4 Sonnet	Claude Sonnet 4.6	May 4, 2026	May 26, 2026
Vertex AI (Google) Gemini 3 Pro (Beta)	Gemini 3.1 Pro (Beta)	Mar 23, 2026	N/A. Retiring on Apr 23, 2026.
Bedrock (Anthropic) Claude 3.7 Sonnet	Claude Sonnet 4.5	Jan 6, 2026	Feb 26, 2026
Bedrock (Anthropic) Claude 3 Haiku	Claude Haiku 4.5	Jan 6, 2026	Feb 26, 2026
Vertex AI (Google) Gemini 2.0 Flash	Gemini 2.5 Flash	Jan 20, 2026	Feb 20, 2026
Vertex AI (Google) Gemini 2.0 Flash	Gemini 2.5 Flash Lite	Jan 20, 2026	Feb 20, 2026
Azure OpenAI GPT 3.5 Turbo	GPT 4 Omni	Jun 16, 2025	Jul 16, 2025
OpenAI GPT 3.5 Turbo	GPT 4 Omni Mini	Jun 16, 2025	Jul 16, 2025
OpenAI GPT 4	GPT 4 Omni	Jun 2, 2025	Jun 30, 2025
OpenAI GPT 4 Turbo	GPT 4 Omni	May 6, 2025	Jun 30, 2025
OpenAI GPT 4 32k	GPT 4 Omni	Jun 6, 2024	Jun 6, 2025
Azure OpenAI GPT 4 Turbo	GPT 4 Omni	April 7, 2025	May 1, 2025
Azure OpenAI GPT 3.5 Turbo 16k	Azure OpenAI GPT 3.5 Turbo	Nov 6, 2023	Nov 13, 2024
OpenAI GPT 3.5 Turbo 16k	GPT 3.5 Turbo	Nov 6, 2023	Sep 13, 2024

Rerouted Models

These models are rerouted.

Model Provider	Model Family	Version	Rerouted To
Azure OpenAI	GPT-3.5 Turbo	gpt-3.5-turbo-0613	GPT 4 Omni Mini
Azure OpenAI	GPT-3.5 Turbo 16K	gpt-35-turbo-16k-0613	GPT 4 Omni Mini
Azure OpenAI	GPT-4 Turbo	gpt-4-1106-Preview	GPT 4 Omni
Bedrock (Anthropic)	Claude 3 Haiku	claude-3-haiku-20240307	Claude Haiku 4.5
Bedrock (Anthropic)	Claude 3.7 Sonnet	claude-3-7-sonnet-20250219	Claude Sonnet 4.5
OpenAI	GPT-3.5 Turbo	gpt-3.5-turbo-0125	OpenAI GPT 4 Omni Mini
OpenAI	GPT-3.5 Turbo 16K	gpt-3.5-turbo-16k	OpenAI GPT 4 Omni Mini
OpenAI	GPT-4	gpt-4-0613	GPT 4 Omni
OpenAI	GPT-4 32K	gpt-4-32k-0613	GPT 4 Omni
OpenAI	GPT-4 Turbo	gpt-4-0125-preview	GPT 4 Omni
Vertex AI (Google)	Gemini 2.0 Flash	gemini-2.0-flash-001	Gemini 2.5 Flash
Vertex AI (Google)	Gemini 2.0 Flash Lite	gemini-2.0-flash-lite-001	Gemini 2.5 Flash Lite

Announcing New and Deprecated Models

New model announcements and model deprecation announcements are part of the monthly Einstein Platform release notes.

Manage Model Provider Access
Choose which Large Language Model (LLM) providers to allow or not allow in your organization. When access to a model provider is turned on, you can use its language learning models (LLMs) in agents, prompt templates, APIs, and other features in generative AI solutions. Turn off a model provider to block access to its models in your org.
Prepare for Model Deprecation and Rerouting
This document provides guidance on retesting Salesforce AI implementations during model deprecation and rerouting. Deprecation and rerouting are considered temporary. If your model is in one of these states, consider switching to a new model.
Large Language Model Multimodal Support
Salesforce-managed models have different levels of support and limits for including JPG, PNG, or PDF files in a model request.
Large Language Model Limits
Understand limits for supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Limits for each model include requests per minute and token limits.
Salesforce-Owned Models
Salesforce AI Research creates, trains, and fine tunes models to address specific Salesforce use cases. These models are hosted on AWS within the Salesforce Trust Boundary.
Batch Models
Use Prompt Template Batch Processing to generate large quantities of responses for prompt templates asynchronously.

Large Language Model Support

Agentforce Models

Salesforce-Managed Models

Inference Profiles

Model Limits

Beta Models

Bring Your Own Large Language Model (BYOLLM)

Develop LLM Solutions with the Models API

Sustainability

Deprecated Models

Rerouted Models

Announcing New and Deprecated Models

See Also

General Information

Required Cookies

Functional Cookies

Advertising Cookies

General Information

Required Cookies

Functional Cookies

Advertising Cookies

Cookie List

Product Area

Feature Impact

Edition

Experience

Large Language Model Support

Agentforce Models

Salesforce-Managed Models

Inference Profiles

Model Limits

Beta Models

Bring Your Own Large Language Model (BYOLLM)

Develop LLM Solutions with the Models API

Sustainability

Deprecated Models

Rerouted Models

Announcing New and Deprecated Models

See Also