Loading
Feature Disruption - Service Cloud VoiceRead More
Feature degradation | Gmail Email delivery failureRead More
Agentforce and Einstein Generative AI
Table of Contents
Select Filters

          No results
          No results
          Here are some search tips

          Check the spelling of your keywords.
          Use more general search terms.
          Select fewer filters to broaden your search.

          Search all of Salesforce Help
          Harmful Content Notification in a Prompt Response

          Harmful Content Notification in a Prompt Response

          Learn how Prompt Builder alerts you to potentially harmful content in a prompt template response.

          Required Editions

          Available in: Lightning Experience
          Available in: Enterprise, Performance, and Unlimited Editions with the Einstein for Platform, or Einstein or Agentforce for Sales or Service add-on, or Agentforce Foundations

          The Einstein Trust Layer scans the large language model (LLM) prompt responses for toxic language, including violent, abusive, hateful, or sexual language, imagery, or themes that can cause harm.

          When toxic language is detected, the harmful content badge appears in the Response section. The badge displays a warning count and highlights the specific verbiage that triggered the alert.

          If the Harmful Content badge appears, update and retest the prompt template to remove the toxic content. If the reassessed template no longer contains toxic content, the badge warning count returns to 0.

           
          Loading
          Salesforce Help | Article