Dataset Paper

First Y-Short Tandem Repeat Categorical Dataset for Clustering Applications

Table 4

The summary of the distributions of the dataset items.

CategoryDataset itemsNumber of objectsNumber of classesThe distribution of objects

117515E (24), G (20), L (200), J (32), and R (475)
22674L (92), J (6), N (141), and R (28)
32633G (37), Group N (68), and Group T (158)

ā€‰42364D (112), F (64), M (42) and W (18)
251128G2 (30), G4 (8), G5 (10), G8 (18), G10 (17), G16 (10), G17 (12), and G29 (7)
ā€‰611214G2 (9), G10 (17), G15 (6), G18 (6), G20 (7), G23 (8), G26 (8), G28 (8), G34 (7), G44 (6), G35 (7), G46 (7), G49 (10), and G91 (6)