Research Article

Identification of Dry Bean Varieties Based on Multiple Attributes Using CatBoost Machine Learning Algorithm

Table 2

Dataset statistics.

FeaturesAreaPerimeterMajorAxisLengthMinorAxisLengthAspectRationEccentricityConvexAreaEquivDiameterExtentSolidityRoundnessCompactnessShapeFactor1ShapeFactor2ShapeFactor3ShapeFactor4

Quantile statistics
Minimum20786.00534.72192.80122.511.0250.219021057.00162.680.57240.94460.4900.6410.00280.00060.4100.957
5-th percentile29398.35636.31229.51158.861.2130.565629732.30193.470.65540.97750.7640.6910.00340.00080.4780.986
Q140329.00743.56268.51184.641.4430.720940761.25226.600.72260.98480.8230.7580.00530.00100.5740.993
Median52981.50911.06350.62203.691.5750.772453858.50259.730.76370.98800.8660.7950.00630.00140.6320.996
Q375098.751070.86405.75242.331.7270.815476264.75309.220.78880.99000.9080.8310.00690.00200.6900.998
95-th percentile181172.551627.96615.47381.172.0780.8766183753.30480.290.81500.99210.9580.9070.00800.00270.8230.999
Maximum2546161985.37738.86460.202.4300.9114263261569.370.85840.99470.9880.9870.01050.00370.9751.000
Range2338301450.65546.06337.691.4050.6925242204406.690.28600.05010.4980.3470.00770.00310.5640.042
Interquartile range (IQR)34769.75327.30137.2457.700.2840.094535503.5082.620.06620.00520.0850.0730.00160.00100.1160.005
Descriptive statistics
Standard deviation46096.53296.60115.0066.840.2460.091546750.5085.160.04950.00490.06000.0610.0010.0010.0990.005
Coefficient of variation (CV)0.660.310.320.290.1540.12110.660.300.06580.00500.06950.0770.2290.4010.1550.005
Kurtosis2.010.560.461.090.0001.68402.020.890.77467.08530.4222−0.223−0.289−0.587−0.1046.656
Mean69718.99969.54362.43227.471.5970.755870681.22285.520.75250.98680.86370.7960.0060.0020.6380.994
Median absolute deviation (MAD)15923.50164.5370.3327.030.1410.046016298.0039.650.03010.00240.04240.0370.00100.0580.002
Skewness1.731.141.021.430.465−1.16751.731.32−0.9632−1.996−0.4420.156−0.4580.6530.358−2.035