Research Article

Does Determination of Initial Cluster Centroids Improve the Performance of -Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study

Table 3

Comparison among the hybrid and ordinary -means clustering method based on eleven evaluation criteria.

IndexesSSESiRPTDunnRIARIACFHIVI

Leukemia dataset
_Means5116.20.47020.8800.14310.88090.76170.93750.88480.761970.6477
+H2116.20.46500.8800.14310.88090.76170.93750.88480.761970.6477
+MST1116.80.46750.87190.16790.90920.81830.95310.91150.81840.5357
+GA4116.20.47020.88010.14310.88090.76170.93750.88480.761970.6477

Prostate dataset
_Means660.30.26770.51490.09690.62980.25990.76670.62470.26021.51
+H158.10.39440.61410.15490.59540.19800.73370.63640.20691.21
+MST262.10.39350.74980.22390.49190.00190.56670.59150.00221.51
+GA458.70.27960.53850.14980.71260.42470.83330.70310.42471.29

Colon dataset
_Means4161.470.52480.96500.14310.86500.730.92790.86380.73000.7411
+H2161.470.52480.96500.14310.86500.730.92790.86380.73000.7411
+MST3161.470.52480.96500.14310.86500.730.92790.86380.73000.7411
+GA2161.470.52480.96500.14310.86500.730.92790.86380.73000.7411

Haberman dataset
_Means6698.80.24770.47870.0230.4991-0.0020.51960.5483-0.00151.83
+H4684.40.27330.52560.0350.50380.00830.55230.55230.00851.82
+MST4702.80.38880.74270.0730.61890.12840.74510.72700.74510.1405
+GA5682.10.27510.53050.0390.4997-0.0010.52610.5488-0.0031.83

Iris dataset
_Means71400.45890.84460.026370.83220.62010.83330.74520.62011.079
+H3141.10.45540.83590.077560.84310.64510.85330.76220.64521.072
+MST5191.70.47870.89170.053090.71970.42900.57320.65050.44881.19
+GA31400.45890.84460.026370.83220.62010.83330.74520.62011.079

Wine dataset
_Means81589.10.24690.47880.13570.69150.37570.60670.62370.39271.42
+H21270.20.29050.54810.23230.95430.89750.96630.93190.89760.39
+MST41270.20.28490.54810.23230.95430.89750.96630.93190.89760.39
+GA41270.20.28490.54810.23230.95430.89750.96630.93190.89760.39

Glass dataset
_Means13687.40.34110.63690.058040.68910.19660.43460.40730.19662.8
+H2679.90.34580.64330.049060.69260.20360.43950.41160.20362.73
+MST4790.20.30210.57540.066440.65310.19080.35980.43270.19542.60
+GA10678.60.34270.63900.045020.68790.19460.47660.40620.19462.84

: number of iteration: ARI: adjusted rand index, ; RI: rand index, ); VI: variation of information, . AC: accuracy, ; Si: silhouette, ); HI: Huber’s Γ index, ; RPT: robustness-performance trade-off, ; +H: hierarchical -means clustering; +MST: minimum spanning tree -means clustering; +GA: genetic -means clustering.