Synthetic

Cluster Analysis

Datasets used in the paper “A novel optimization approach towards improving separability of clusters“

SD1 Download

SD2 Download

2. Datasets used in the paper “Robust clustering algorithm: The use of soft trimming approach“

3. Datasets used in the paper “Absolute indices for determining compactness, separability and number of clusters“

DA1 Download

DA2 Download

DA3 Download

Semi-Supervised Clustering (SSC)

Must-link (ML) and cannot-link (CL) pairs: sample data generated randomly using Gaussian distribution: there are 70 data points with six ML and six CL pairs. “Data_SSC_points” has the data points, and “Data_SSC_ML” and “Data_SSC_CL” present the location of data points with ML and CL pairs, respectively. The figure below illustrates the data set: the data points are grouped in four clusters where points in each cluster are denoted using the same color: orange for the points in the first cluster; green for the points in the second cluster; purple for the points in the third cluster, and black for the points in the fourth cluster. The points with ML pairs are joined using the blue solid lines, and those with CL pairs are presented using red dashed lines.