Research Article

Novel Automated K-means++ Algorithm for Financial Data Sets

Table 5

Evaluate the quality of clustering according to external validity indexes.

Dataset and algorithmSet matching measuresPair-counting measuresEntropy
PurityRecallPrecisionF-measureRand-indexJaccard-index

S_1 K-means0.870.680.620.650.840.480.62
S_1 K-means++0.880.780.780.780.890.600.46
S_1 SDK-means++0.980.960.980.970.920.950.06
S_2 K-means0.820.800.700.750.920.590.57
S_2 K-means++0.860.830.840.830.920.710.31
S_2 SDK-means++0.970.970.980.970.990.960.08
S_3 K-means0.870.260.830.400.650.250.49
S_3 K-means++0.880.280.860.420.660.260.47
S_3 SDK-means++0.900.330.880.480.690.320.38
S_4 K-means0.860.380.830.520.830.350.56
S_4 K-means++0.860.360.830.500.830.340.52
S_4 SDK-means++0.900.490.910.640.870.470.41
S_5 K-means0.850.290.770.420.800.270.53
S_5 K-means++0.880.290.800.430.800.270.49
S_5 SDK-means++0.890.320.880.470.820.310.43
RMSE & K-means0.0180.2170.0810.1340.0880.1290.043
RMSE & K-means++0.0090.2440.0280.1760.0910.1840.073
RMSE & SDK-means++0.0380.2920.0420.2230.1010.2930.165