Loading
Feature Degradation | Agentforce Voice Read More
Agentforce and Einstein Generative AI
Table of Contents
Select Filters

          No results
          No results
          Here are some search tips

          Check the spelling of your keywords.
          Use more general search terms.
          Select fewer filters to broaden your search.

          Search all of Salesforce Help
          Large Language Model Support

          Large Language Model Support

          Understand supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Identify the Salesforce-managed models that are available out of the box. Learn how you can bring your own model (BYOLLM) by using AI Models (formerly Einstein Studio).

          Note
          Note Changing the model can affect your usage, see Einstein Usage.

          Agentforce Models

          This page focuses on supported LLMs for embedded features, such as Prompt Builder. For model options in Agentforce, see Select Agentforce Model Option.

          Salesforce-Managed Models

          Quickly get started with generative AI features by choosing a Salesforce-managed model. Features like Prompt Builder and the Models API allow you to customize AI implementations with different models and use them in your apps. Salesforce-managed models are enabled by default to speed up the configuration process.

          This table lists the Salesforce-managed models that are available for embedded features, such as Prompt Builder. For Agentforce, see the Agentforce Models section.

          Model Provider Model Family Version Usage Type Model Regions Notes
          Bedrock (Amazon) Nova Lite nova-lite-2024-12-04 Basic Prompts Inference profiles: apac.amazon.nova-lite-v1:0, ca.amazon.nova-lite-v1:0, eu.amazon.nova-lite-v1:0, us.amazon.nova-lite-v1:0  
          Bedrock (Amazon) Nova Pro nova-pro-2024-12-04 Standard Prompts Inference profiles: apac.amazon.nova-pro-v1:0, eu.amazon.nova-pro-v1:0, us.amazon.nova-pro-v1:0  
          Bedrock (Anthropic) Claude Haiku 4.5 claude-haiku-4-5-20251001 Standard Prompts Inference profiles: eu.anthropic.claude-haiku-4-5-20251001-v1:0, global.anthropic.claude-haiku-4-5-20251001-v1:0, us.anthropic.claude-haiku-4-5-20251001-v1:0 Reasoning is not supported
          Bedrock (Anthropic) Claude Opus 4.5 claude-opus-4-5-20251101 Advanced Prompts United States Reasoning is not supported
          Bedrock (Anthropic) Claude Opus 4.6 (Beta) claude-opus-4-6-2026-02-05 Advanced Prompts Inference profiles: au.anthropic.claude-opus-4-6-v1, eu.anthropic.claude-opus-4-6-v1, us.anthropic.claude-opus-4-6-v1 Reasoning is not supported
          Bedrock (Anthropic) Claude Opus 4.7 (Beta) claude-opus-4-7-2026-04-16 Advanced Prompts Inference profiles: eu.anthropic.claude-opus-4-7, jp.anthropic.claude-opus-4-7, us.anthropic.claude-opus-4-7 Reasoning is not supported
          Bedrock (Anthropic) Claude Sonnet 4 claude-sonnet-4-20250514 Standard Prompts Inference profiles: apac.anthropic.claude-sonnet-4-20250514-v1:0, eu.anthropic.claude-sonnet-4-20250514-v1:0, us.anthropic.claude-sonnet-4-20250514-v1:0  
          Bedrock (Anthropic) Claude Sonnet 4.5 claude-sonnet-4-5-20250929 Standard Prompts Inference profiles: au.anthropic.claude-sonnet-4-5-20250929-v1:0, eu.anthropic.claude-sonnet-4-5-20250929-v1:0, global.anthropic.claude-sonnet-4-5-20250929-v1:0, jp.anthropic.claude-sonnet-4-5-20250929-v1:0, us.anthropic.claude-sonnet-4-5-20250929-v1:0 Reasoning is not supported
          Bedrock (Anthropic) Claude Sonnet 4.6 claude-sonnet-4-6-2026-02-17 Standard Prompts Inference profiles: au.anthropic.claude-sonnet-4-6, eu.anthropic.claude-sonnet-4-6, jp.anthropic.claude-sonnet-4-6, us.anthropic.claude-sonnet-4-6 Reasoning is not supported
          Bedrock (NVIDIA) Nemotron 3 Nano 30B (Beta) nvidia.nemotron-nano-3-30b Basic Prompts Brazil, India, Italy, Japan, United Kingdom, United States  
          OpenAI and Azure OpenAI GPT-4o (GPT 4 Omni) gpt-4o-2024-11-20 Standard Prompts

          OpenAI: United States

          Azure OpenAI: Australia, Brazil, Canada, France, Germany, India, Japan, Sweden, Switzerland, United Kingdom, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI GPT-4o Mini gpt-4o-mini-2024-07-18 Basic Prompts United States  
          OpenAI and Azure OpenAI GPT-4o-mini (GPT 4 Omni Mini) gpt-4o-mini-2024-07-18 Basic Prompts

          OpenAI: United States

          Azure OpenAI: France, Germany, Japan, Sweden, United Kingdom, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI and Azure OpenAI GPT-4.1 gpt-4.1-2025-04-14 Standard Prompts

          OpenAI: United States

          Azure OpenAI: Australia, Brazil, France, Germany, India, Japan, Singapore, Sweden, United Kingdom, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI and Azure OpenAI GPT-4.1 Mini gpt-4.1-mini-2025-04-14 Basic Prompts

          OpenAI: United States

          Azure OpenAI: Australia, Canada, India, Japan, United Kingdom, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI and Azure OpenAI GPT-5 gpt-5-2025-08-07 Standard Prompts

          OpenAI: United States

          Azure OpenAI: Sweden, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI and Azure OpenAI GPT-5 Mini gpt-5-mini-2025-08-07 Basic Prompts

          OpenAI: United States

          Azure OpenAI: Sweden, United States

          See Geo-Aware Routing for OpenAI and Azure OpenAI.
          OpenAI and Azure OpenAI GPT 5.1 gpt-5.1-2025-11-13 Standard Prompts

          OpenAI: United States

          Azure OpenAI: Sweden, United States

          Reasoning is not supported
          OpenAI and Azure OpenAI GPT 5.2 gpt-5.2-2025-12-11 Standard Prompts

          OpenAI: United States

          Azure OpenAI:United States

          Reasoning is not supported
          OpenAI and Azure OpenAI GPT 5.4 gpt-5.4-2026-03-05 Standard Prompts

          OpenAI: United States

          Azure OpenAI:United States

          Reasoning is not supported
          OpenAI and Azure OpenAI GPT 5.4 Mini (Beta) gpt-5.4-mini-2026-03-17 Basic Prompts United States Reasoning is not supported
          OpenAI and Azure OpenAI GPT 5.5 (Beta) gpt-5.5-2026-04-24 Advanced Prompts United States Reasoning is not supported
          OpenAI and Azure OpenAI O3 o3-2025-04-16 Standard Prompts

          OpenAI: United States

          Azure OpenAI: France, Germany, Sweden, United States

           
          OpenAI and Azure OpenAI O4 Mini o4-mini-2025-04-16 Standard Prompts

          OpenAI: United States

          Azure OpenAI: France, Germany, Sweden, United States

           
          Vertex AI (Google) Gemini 2.5 Flash gemini-2.5-flash-2025-06-17 Basic Prompts Australia, Canada, India, Japan, Netherlands, Singapore, South Korea, United Kingdom, United States  
          Vertex AI (Google) Gemini 2.5 Flash Lite gemini-2.5-flash-lite-2025-07-22 Basic Prompts Netherlands, United States  
          Vertex AI (Google) Gemini 2.5 Pro gemini-2.5-pro-2025-06-17 Standard Prompts Netherlands, United States  
          Vertex AI (Google) Gemini 3 Flash (Beta) gemini-3-flash-preview-2025-12-17 Basic Prompts United States  
          Vertex AI (Google) Gemini 3 Pro (Beta) gemini-3-pro-preview-2025-11-18 Standard Prompts United States Reasoning is not supported. Retiring on April 23, 2026.
          Vertex AI (Google) Gemini 3.1 Flash Lite (Beta) gemini-3.1-flash-lite-preview-2026-03-03 Basic Prompts United States  
          Vertex AI (Google) Gemini 3.1 Pro (Beta) gemini-3.1-pro-preview-2026-02-19 Standard Prompts United States Reasoning is not supported
          Note
          Note All the Salesforce-managed models might not be available in your org.

          For more details about Usage Type, see Agentforce and Generative AI Usage and Billing.

          For more details about these supported models, see Supported Models in the Agentforce Developer Guide.

          Inference Profiles

          Some Anthropic models are accessible in particular AWS Regions only as a cross-region inference profile. Cross-region inference requests are kept within the AWS regions that are part of the geography where the request originates. For example, a request made within the United States to Claude Sonnet 4 is kept within the AWS Regions in the United States. A request made in Japan can be serviced by any of the destinations in the apac.anthropic.claude-sonnet-4-20250514-v1:0 inference profile.

          For more information, see Amazon Bedrock documentation:

          For routing for each Salesforce org region for Claude Sonnet 4, see Geo-Aware Routing for Anthropic.

          Model Limits

          For information about limits per model, such as requests per minute (RPM) and token limits, see Large Language Model Limits.

          Beta Models

          Beta models are new models from model providers that Salesforce is beta testing. Beta models typically have lower rate limits and may not be available in all regions. A beta model has (Beta) appended to its name. If beta models aren't turned on, they appear as (Disabled) in AI Models.

          Note
          Note This feature is a pilot or beta service that is subject to the Beta Services Terms at Agreements - Salesforce.com or a written Unified Pilot Agreement if executed by Customer, and applicable terms in the Product Terms Directory. Use of this pilot or beta service is at the Customer's sole discretion.

          Before you can turn on beta generative AI models, you must turn on Data 360. Data 360 is automatically provisioned as soon as a Data Cloud license is added to your Salesforce org. See Turn On Data 360.

          To turn on beta generative AI models, go to Einstein Setup. After beta models are enabled, you can see them in AI Models and use them just like any Salesforce-managed model.

          We recommend that you enable beta models in sandbox or development orgs only.

          Bring Your Own Large Language Model (BYOLLM)

          The Einstein platform allows you to customize your AI experience by bringing in your own models to Salesforce. You can bring in your own model by using AI Models and write a prompt template in Prompt Builder, which you can then integrate into your own apps or an agent. Some common reasons companies want to use different models with Einstein include:

          • Your company has an LLM fine-tuned to your data.
          • You can use your Azure, Bedrock, OpenAI, or Vertex account.

          BYOLLM supports many of the Salesforce-managed models and these additional models:

          Model Provider Model Family Notes
          Bedrock (Anthropic) Claude 3 Opus Retired by Bedrock
          Bedrock (Anthropic) Claude 3 Sonnet Retired by Bedrock
          Bedrock (Anthropic) Claude 3.5 Sonnet Retired by Bedrock
          Vertex AI (Google) Gemini 1.5 Pro Retired by Vertex AI

          To use models and providers not listed on this page, see the LLM Open Connector.

          Note
          Note For BYOLLM and LLM Open Connector, Salesforce uses a set of IP addresses to communicate with an LLM that you host yourself. If your security policy requires a strict network Access Control List (ACL), make sure to add to your allowlists the IP addresses in the BYO Models and Open Connector IP Addresses section of Salesforce Core Services - IP Addresses and Domains to Allow.

          Develop LLM Solutions with the Models API

          Developers can use Models API to code custom solutions. See the Models API Developer Guide.

          Sustainability

          Sustainability is a core value at Salesforce. Selecting the appropriate model is one of the most effective ways to reduce energy consumption, water usage, and carbon emissions. Compare the environmental impact of these models by using the relative Sustainability Score in the bottom-right section of the Agentic Benchmark page on the AI Research site. For more details on Salesforce's approach to AI sustainability, see Sustainability at Salesforce.

          Deprecated Models

          Model deprecation is the process by which a model provider gradually phases out a model, usually in favor of a new and improved model. A deprecated model may reroute to a preferred model to ensure continuity of service. To learn more, see Prepare for Model Deprecation and Rerouting.

          We recommend that you start migrating your applications as soon as the deprecation is announced. During migration, update and test each part of your application with the replacement model that we recommend.

          These models are deprecated or rerouted.

          Deprecated Model Recommended Replacement Deprecated Date Reroute Date
          Bedrock (Anthropic) Claude 4 Sonnet Claude Sonnet 4.6 May 4, 2026 May 26, 2026
          Vertex AI (Google) Gemini 3 Pro (Beta) Gemini 3.1 Pro (Beta) Mar 23, 2026 N/A. Retiring on Apr 23, 2026.
          Bedrock (Anthropic) Claude 3.7 Sonnet Claude Sonnet 4.5 Jan 6, 2026 Feb 26, 2026
          Bedrock (Anthropic) Claude 3 Haiku Claude Haiku 4.5 Jan 6, 2026 Feb 26, 2026
          Vertex AI (Google) Gemini 2.0 Flash Gemini 2.5 Flash Jan 20, 2026 Feb 20, 2026
          Vertex AI (Google) Gemini 2.0 Flash Gemini 2.5 Flash Lite Jan 20, 2026 Feb 20, 2026
          Azure OpenAI GPT 3.5 Turbo GPT 4 Omni Jun 16, 2025 Jul 16, 2025
          OpenAI GPT 3.5 Turbo GPT 4 Omni Mini Jun 16, 2025 Jul 16, 2025
          OpenAI GPT 4 GPT 4 Omni Jun 2, 2025 Jun 30, 2025
          OpenAI GPT 4 Turbo GPT 4 Omni May 6, 2025 Jun 30, 2025
          OpenAI GPT 4 32k GPT 4 Omni Jun 6, 2024 Jun 6, 2025
          Azure OpenAI GPT 4 Turbo GPT 4 Omni April 7, 2025 May 1, 2025
          Azure OpenAI GPT 3.5 Turbo 16k Azure OpenAI GPT 3.5 Turbo Nov 6, 2023 Nov 13, 2024
          OpenAI GPT 3.5 Turbo 16k GPT 3.5 Turbo Nov 6, 2023 Sep 13, 2024

          Rerouted Models

          These models are rerouted.

          Model Provider Model Family Version Rerouted To
          Azure OpenAI GPT-3.5 Turbo gpt-3.5-turbo-0613 GPT 4 Omni Mini
          Azure OpenAI GPT-3.5 Turbo 16K gpt-35-turbo-16k-0613 GPT 4 Omni Mini
          Azure OpenAI GPT-4 Turbo gpt-4-1106-Preview GPT 4 Omni
          Bedrock (Anthropic) Claude 3 Haiku claude-3-haiku-20240307 Claude Haiku 4.5
          Bedrock (Anthropic) Claude 3.7 Sonnet claude-3-7-sonnet-20250219 Claude Sonnet 4.5
          OpenAI GPT-3.5 Turbo gpt-3.5-turbo-0125 OpenAI GPT 4 Omni Mini
          OpenAI GPT-3.5 Turbo 16K gpt-3.5-turbo-16k OpenAI GPT 4 Omni Mini
          OpenAI GPT-4 gpt-4-0613 GPT 4 Omni
          OpenAI GPT-4 32K gpt-4-32k-0613 GPT 4 Omni
          OpenAI GPT-4 Turbo gpt-4-0125-preview GPT 4 Omni
          Vertex AI (Google) Gemini 2.0 Flash gemini-2.0-flash-001 Gemini 2.5 Flash
          Vertex AI (Google) Gemini 2.0 Flash Lite gemini-2.0-flash-lite-001 Gemini 2.5 Flash Lite

          Announcing New and Deprecated Models

          New model announcements and model deprecation announcements are part of the monthly Einstein Platform release notes.

          • Manage Model Provider Access
            Choose which Large Language Model (LLM) providers to allow or not allow in your organization. When access to a model provider is turned on, you can use its language learning models (LLMs) in agents, prompt templates, APIs, and other features in generative AI solutions. Turn off a model provider to block access to its models in your org.
          • Prepare for Model Deprecation and Rerouting
            This document provides guidance on retesting Salesforce AI implementations during model deprecation and rerouting. Deprecation and rerouting are considered temporary. If your model is in one of these states, consider switching to a new model.
          • Large Language Model Multimodal Support
            Salesforce-managed models have different levels of support and limits for including JPG, PNG, or PDF files in a model request.
          • Large Language Model Limits
            Understand limits for supported large language models (LLMs) from multiple providers for embedded features, such as Prompt Builder. Limits for each model include requests per minute and token limits.
          • Salesforce-Owned Models
            Salesforce AI Research creates, trains, and fine tunes models to address specific Salesforce use cases. These models are hosted on AWS within the Salesforce Trust Boundary.
          • Batch Models
            Use Prompt Template Batch Processing to generate large quantities of responses for prompt templates asynchronously.
           
          Loading
          Salesforce Help | Article