Customers may encounter situations where Agentforce-generated content appears incomplete or cut off. Common examples include partially generated email bodies, truncated action responses, or agent replies ending abruptly.These issues are typically caused by system-enforced size limits applied during Large Language Model (LLM) processing. Truncation can occur at multiple stages of the agent execution flow, depending on the size of the data being generated or returned. Understanding where truncation occurs and the applicable limits helps ensure more reliable agent behavior and better end-user experiences.
Truncation may occur in one or more of the following areas during agent execution:
Action Input Parameters
When the LLM generates a large input for an action—such as a long email body or extensive structured text—the content may be cut off if it exceeds the completion token limit.
Action Outputs
If an invoked action returns a large payload (for example, a query with many records or verbose text fields), the result may be truncated before being passed back to the LLM.
Agent Responses
When the agent attempts to return a long response directly to the user, generation stops once the token limit is reached.
| Component | Limit | What This Means |
|---|---|---|
| LLM response generation (including action inputs) | ~2048 tokens | The agent can only generate content up to this size in a single response or action input |
| Action results returned to the LLM | ~65,000 characters | If an action returns more than this limit, only the first 65,000 characters are retained |
| Output Size | Truncation? | Impact |
|---|---|---|
| Less than 65,000 characters | No | The full action output is passed to the LLM |
| 65,000 characters or more | Yes | The output is truncated to the first 65,000 characters; the LLM only processes this portion |
Important:
When truncation occurs, the LLM has no visibility into the discarded content. Any reasoning, summarization, or follow-up actions are based solely on the retained portion of the data.
Agent output generation is subject to a completion limit of approximately 2048 tokens. Once this limit is reached, the model stops generating further output, resulting in truncated responses.
Using HTML formatting—particularly <table> tags—can significantly accelerate reaching this limit. HTML tables introduce substantial token overhead due to repetitive opening and closing tags, nested markup, and structural elements. As a result, even moderately sized datasets may cause truncation when rendered using HTML tables.
If HTML table formatting is required, use one or more of the following approaches to reduce the likelihood of truncation:
Limit the number of records returned per request
Constrain the dataset at the source so the generated output remains within token limits.
Explicitly constrain output size in the user request
Examples:
“Output the first 10 records and their key fields in a table format using <table> tags.”
“Output the first 20 records and selected fields in a table format using <table> tags.”
<table> tags and instead use a simpler instruction such as:005298756

We use three kinds of cookies on our websites: required, functional, and advertising. You can choose whether functional and advertising cookies apply. Click on the different cookie categories to find out more about each category and to change the default settings.
Privacy Statement
Required cookies are necessary for basic website functionality. Some examples include: session cookies needed to transmit the website, authentication cookies, and security cookies.
Functional cookies enhance functions, performance, and services on the website. Some examples include: cookies used to analyze site traffic, cookies used for market research, and cookies used to display advertising that is not directed to a particular individual.
Advertising cookies track activity across websites in order to understand a viewer’s interests, and direct them specific marketing. Some examples include: cookies used for remarketing, or interest-based advertising.