Structured Clustering (Beta)

Use clustering in Data 360 to uncover hidden patterns and discover natural groupings in structured data based on similarities in the fields that you select. Clustering identifies emerging customer profiles, recurring support patterns, and unexpected behavioral trends.

After training, the model assigns each record to a cluster ID and a system-generated cluster label. For example, a model trained on purchase data identifies groups such as "Loyal Customers," "Big Spenders," or "At-Risk Customers" automatically. The trained model maps new records consistently to existing clusters over time. It evaluates incoming data and assigns a similarity score that indicates how closely each record matches its cluster. Data 360 adds these output fields to your dataset. Apply the output fields to drive segmentation, analytics, and activation across Salesforce.

Note Clustering is a beta service that is subject to the Beta Services Terms at Agreements — Salesforce.com or a written Unified Beta Agreement if executed by Customer, and applicable terms in the Product Terms Directory. Use of this beta service is at the Customer's sole discretion.

Review these clustering goals.

Segment customers or records into meaningful groups automatically
Discover patterns and similarities across data attributes
Identify high-value, at-risk, or behavior-based segments
Enrich datasets with cluster IDs and labels for targeting, personalization, and analysis

Clustering Settings

To use clustering, turn on the Structured Clustering (Beta) from Feature Manager in Data Cloud Setup.

Feature Manager in the Data Cloud Setup with the Structured Clustering (Beta) feature settings.

Clustering Capabilities

The K-means algorithm groups records into a defined number of clusters based on similarity. The model defaults to 3 clusters, but you can define this setting manually. To cluster noisy or real-world data and group outliers into a separate catch-all cluster, use the HDBScan algorithm.

Review these clustering capabilities.

Model building and training with structured data model objects (DMOs) in the AI Models tab
Cluster quality evaluation with training metrics, such as cohesiveness score, distinctness score, and silhouette score
Cluster summaries and representative sample records that show which field values characterize each cluster
Custom cluster labels assigned to override system-generated labels
Clustering inference on new data via flows to assign records to existing clusters
Inference outputs, such as cluster ID, cluster label, similarity score, and top factors

Cluster Quality Metrics

To assess whether your clusters are meaningful, review these metrics after model training.

Cluster Metrics
METRIC	whAT IT MeASURES
Cohesiveness Score	How similar records within a cluster are to each other. Higher is better.
Distinctness Score	How different clusters are from each other. Lower values indicate more distinct clusters.
Silhouette Score	Combined measure of cohesion and separation, ranging from -1 to 1. Scores above 0.7 indicate well-defined clusters.

If scores are low, remove variables that create noise, adjust the number of clusters, or retrain the model.

When you run a flow on new data during inference, each record receives these outputs.

A cluster ID
A cluster label
A similarity score

You can assign these outputs to records, use existing flow components or the ingestion API.

Considerations

Review these considerations.

Results vary depending on the fields and parameters used.
Very small or highly similar datasets produce less distinct clusters.
Relevant and high-quality input fields provide better clustering results.
Periodic retraining keeps clusters aligned with changing data.

Did this article solve your issue?

Let us know so we can improve!

Structured Clustering (Beta)

Clustering Settings

Clustering Capabilities

Cluster Quality Metrics

Considerations

General Information

Required Cookies

Functional Cookies

Advertising Cookies

General Information

Required Cookies

Functional Cookies

Advertising Cookies

Cookie List

Product Area

Feature Impact

Edition

Experience

Structured Clustering (Beta)

Clustering Settings

Clustering Capabilities

Cluster Quality Metrics

Considerations