You are here:
Convert Text To Speech Action (Beta)
Add the Convert Text to Speech action to a flow to convert text to spoken audio.
Required Editions
| View supported editions. |
In Flow Builder, add an Action element to your flow. In the Action field, search for Convert Text to Speech, and select Convert Text To Speech.
Set Input Values
| Field | Description |
|---|---|
| Input Text | Required. The text to convert to spoken audio. |
| Voice Speed | Optional. The playback speed of the generated speech. |
| Voice Stability | Optional. The steadiness and consistency of the generated voice. Lower values can produce more expressive speech, while higher values produce more stable speech output. |
| Voice ID | Optional. The identifier of the voice used to generate the spoken audio. |
| File Output | Optional. Specifies whether the response returns audio as a file output instead of Base64-encoded audio. The default is false. |
Store Output Values
| Field | Description |
|---|---|
| Converted Audio | The spoken audio generated from the input text, returned as Base64-encoded output. |
Use this action in a Flow to convert text into spoken audio. By default, the output is returned in Base64-encoded format. You can optionally return audio as a file output by setting File Output to true, and use the generated audio in subsequent Flow elements or custom playback components.
Considerations
- The action consumes Flex Credits through the Digital Wallet based on the applicable rate card.
- The action is subject to invocation limits per minute. Requests that exceed the limit fail.
- The invocation limit is 100 requests per minute.
- The action returns a Base64-encoded audio string and doesn’t automatically play or store the generated audio.
- When file output is set to true, ensure that downstream elements handle file output appropriately.
- Processing time can vary based on input size and system load.
Did this article solve your issue?
Let us know so we can improve!

