Mulesoft MongoDB Connector - Aggregation Pipeline Limits

Veröffentlichungsdatum: Dec 22, 2025

Beschreibung

When using the Mulesoft MongoDB connector Execute command using the Mongo Aggregation command while a pipeline has a blank value so it looks like [], the full result set is not being returned.

You may experience inconsistencies when trying to retrieve large datasets using the ExecuteCommand operation with data similar to the following:

output application/json

---

{

aggregate: ‘myCollection’,

pipeline: [],

cursor: {

batchSize: 1000000

}

output application/json

---

{

aggregate: ‘myCollection’’,

pipeline: [ {

"$project": {

"awesomeColumn": {

"_id": 1

}

}],

cursor: {

batchSize: 1000000

}

n results

y results (where y < n)

Lösung

MongoDB has a series of limitations when trying to manage large datasets and this information is helpful with the MuleSoft connector.

Result Size restrictions: Each document in the result set is subject to the 16 megabyte BSON Document Size limit.
Numbers of Stages Restrictions: MongoDB 5.0 limits the number of aggregation pipeline stages allowed in a single pipeline to 1000.
Memory restrictions: Each individual pipeline stage has a limit of 100 megabytes of RAM.

Some of these restrictions can be configured on the MongoDB instance (that is, from the server-side, outside of the connector scope). The MongoDB Aggregation Pipeline Limits article can be consulted for more information about this.
When any of these limits are reached:

MongoDB will automatically paginate the requests to avoid performance issues.
The batchSize will be ignored by the database if needed.
There will be a different amount of results depending on the data that has to be retrieved for each row (more data => fewer results per batch).

SOLUTION

In MuleSoft, when experiencing inconsistencies when trying to retrieve large datasets using the ExecuteCommand operation, user-side pagination will need to be completed:

The ExecuteCommand operation returns a cursor as a result that will allow the user to retrieve more information from the database. A common paginated result can look something like this:

{
    "cursor": {
        "firstBatch": [
            {
                /... Data .../
            },
		...
        ],
        "id": 5831430585665131451,
        "ns": "myDatabase.myCollection"
    },
    "ok": 1.0
}

When the cursor.id has a value different than 0, it means that there are more results. The user can retrieve more results using the getMore command like this:

{
"getMore": 5831430585665131451, <= cursor.id received as result from the first request
"collection": "myCollection"
}

The result will look like this:

{
    "cursor": {
        "nextBatch": [
            {
                /... Data .../
            },
		...
        ],
        "id": 0,
        "ns": "myDatabase.myCollection"
    },
    "ok": 1.0
}

The value of the cursor.idcan be either 0 or the same id used for the request. This id will appear repeated on every request until there are no more batches to retrieve.

Nummer des Knowledge-Artikels

001122165

Konnten Sie Ihr Problem mithilfe dieses Artikels lösen?

Geben Sie uns Feedback, damit wir uns verbessern können.

Mulesoft MongoDB Connector - Aggregation Pipeline Limits

SOLUTION

General Information

Required Cookies

Functional Cookies

Advertising Cookies

General Information

Required Cookies

Functional Cookies

Advertising Cookies

Cookie List