Geo-Aware Routing for OpenAI and Azure OpenAI

A geo-aware model, such as GPT 4 Omni, uses OpenAI or Azure OpenAI to service LLM requests. The same underlying model is used by OpenAI and Azure OpenAI.

Required Editions

Available in: Enterprise, Performance, and Unlimited Editions with an Einstein for Sales, Einstein for Platform, Einstein for Service, Einstein 1 Service, or Einstein GPT Service add-on. To purchase add-ons, contact your Salesforce account executive.

Requests in the United States are serviced in the United States by OpenAI, and in some cases Azure OpenAI. Requests outside the United States are routed to a supported Azure OpenAI endpoint that is closest to where your Einstein generative AI platform instance is located.

Azure OpenAI

This table shows geo-aware routing for Azure OpenAI requests for GPT 4 Omni. Each geo-aware model has its own set of model regions where model endpoints are deployed. Models other than GPT 4 Omni may not have model endpoints in all the regions listed in the table. For more information for each model, see Large Language Model Support.

The columns are:

Salesforce Org Region: The region where a Salesforce org is hosted
Primary Model Endpoint Region: The primary region for the Azure OpenAI endpoint
Secondary Model Endpoint Regions: The secondary in-region model endpoints where model requests are retried if the primary model endpoint isn't responsive. If the primary and secondary endpoints aren't responsive, the request falls back to Azure OpenAI in the United States. If the request fails on each primary and secondary model endpoint and the model endpoints in the United States, an error is returned. See the Disable Fallback to the United States section after the table.

Salesforce Org Region	Primary Model Endpoint Region	Secondary Model Endpoint Regions
Australia	Australia	None
Brazil	Brazil	None
Canada	Canada	None
France	France	Germany Sweden Switzerland
Germany	Germany	France Sweden Switzerland
India	India	None
Indonesia	Australia	None
Japan	Japan	None
South Korea	Japan	None
Singapore	Australia	None
Sweden	Sweden	France Germany Switzerland
Switzerland	Switzerland	France Germany Sweden
United Kingdom	United Kingdom	None
United States	United States	None

Disable Fallback to the United States

Before you can disable the final fallback to the United States, you must turn on Data 360. Data 360 is automatically provisioned as soon as a Data Cloud license is added to your Salesforce org. See Turn On Data 360.

To disable the final fallback to the United States, go to Einstein Setup and toggle Turn Off Model Provider Region Fallback. Instead of falling back to the United States, an error is returned. Requests are still tried on the primary and secondary model endpoints before an error is returned.

Batch Processing

To asynchronously generate large quantities of responses for prompt templates, you can use prompt template batch processing. These models support a global endpoint for native batch processing by Azure OpenAI.

GPT 4 Omni
GPT 4 Omni Mini
GPT 4.1

For these models with global endpoints, batch requests may be processed in regions outside your org’s region, even if non-batch LLM requests are restricted to specific in-region Azure endpoints. Batch processing doesn't follow geo-aware routing rules and doesn’t provide in-region enforcement.

For more information about batch processing, see Prompt Template Batch Processing in Apex in the Agentforce Developer Guide.

Geo-Aware Routing for OpenAI and Azure OpenAI

Required Editions

Azure OpenAI

Disable Fallback to the United States

Batch Processing

See Also

General Information

Required Cookies

Functional Cookies

Advertising Cookies

General Information

Required Cookies

Functional Cookies

Advertising Cookies

Cookie List

Product Area

Feature Impact

Edition

Experience

Geo-Aware Routing for OpenAI and Azure OpenAI

Required Editions

Azure OpenAI

Disable Fallback to the United States

Batch Processing

See Also