Accelerated Hands-Free Field Execution With Speech-to-Text Integration
Give your sales reps a hands-free experience by configuring voice-driven scenarios in
the Consumer Goods Cloud mobile app. Sales reps can use their voice to capture survey data, take
notes, and update orders while online.
Required Editions
Available in: Lightning Experience
Available in: Agentforce 1, Einstein 1, Enterprise, and
Unlimited editions with Consumer Goods
Cloud Retail Execution and Agentforce for Consumer Goods Cloud add-on
licenses.
Admins can create seamless voice-driven workflows by setting up APIs in the mobile app
customization layer using the Visual Studio Code based Modeler. By orchestrating these APIs,
you can design workflows that transcribe verbal requests into structured, AI-driven actions.
The Speech-to-Text API captures the user’s voice and converts it into text, which can be then
passed to the Prompt Template API for further processing.
For these configurations to function, the mobile device must be online, and the Integrate
Agentforce setting in the Consumer Goods mobile app sync setting must be enabled.
The key configuration components are:
Verify network: Use DeviceManager.isOnlineAsync() to
check connectivity. Since the promptTemplateAsync API requires an
internet connection, use this check to disable voice features when the device is offline to
prevent errors and ensure a smooth user experience.
Capture input (Speech-to-Text): Use launchSpeechToTextAsync() to
integrate the native speech-to-text interface. This function captures the user’s spoken
words and returns the transcribed text to the application logic, ready to be used as a
parameter.
Process intent (Prompt Template): Use Facade.executePromptTemplateAsync
to connect to Salesforce and retrieve the AI-generated response by sending the inputs to a
specific prompt template. The API handles the communication, but customizers should define
the business logic to consume the response that is received— for example, capturing data or
dynamically querying the system. This functionality is supported on Android, iOS, and the
Modeler simulator.
Considerations for Speech To Text Integration Review these important considerations for setting up speech-to-text. These include changing session limits and silence detection, designing the user interface and theme, stopping sessions automatically, handling errors, and setting up offline language packs.
Did this article solve your issue?
Let us know so we can improve!
Loading
Salesforce Help | Article
Cookie Consent Manager
General Information
Required Cookies
Functional Cookies
Advertising Cookies
General Information
We use three kinds of cookies on our websites: required, functional, and advertising. You can choose whether functional and advertising cookies apply. Click on the different cookie categories to find out more about each category and to change the default settings.
Privacy Statement
Required Cookies
Always Active
Required cookies are necessary for basic website functionality. Some examples include: session cookies needed to transmit the website, authentication cookies, and security cookies.
Functional Cookies
Functional cookies enhance functions, performance, and services on the website. Some examples include: cookies used to analyze site traffic, cookies used for market research, and cookies used to display advertising that is not directed to a particular individual.
Advertising Cookies
Advertising cookies track activity across websites in order to understand a viewer’s interests, and direct them specific marketing. Some examples include: cookies used for remarketing, or interest-based advertising.