Feature Disruption - Service Cloud Voice

Feature degradation | Gmail Email delivery failure

Agentforce and Einstein Generative AI

Table of Contents

No results

Here are some search tips

Check the spelling of your keywords.
Use more general search terms.
Select fewer filters to broaden your search.

Search all of Salesforce Help

Agentforce and Einstein Generative AI

Harmful Content Notification in a Prompt Response

You are here:

Harmful Content Notification in a Prompt Response

Learn how Prompt Builder alerts you to potentially harmful content in a prompt template response.

Required Editions

Available in: Lightning Experience

Available in: Enterprise, Performance, and Unlimited Editions with the Einstein for Platform, or Einstein or Agentforce for Sales or Service add-on, or Agentforce Foundations

The Einstein Trust Layer scans the large language model (LLM) prompt responses for toxic language, including violent, abusive, hateful, or sexual language, imagery, or themes that can cause harm.

When toxic language is detected, the harmful content badge appears in the Response section. The badge displays a warning count and highlights the specific verbiage that triggered the alert.

If the Harmful Content badge appears, update and retest the prompt template to remove the toxic content. If the reassessed template no longer contains toxic content, the badge warning count returns to 0.

Did this article solve your issue?

Let us know so we can improve!