Research Article

Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data

Table 1

Misclassification rate (MC) of KIN versus KNN in simulation study.

Number of featuresDegree of sparsity (%)Sample sizeRatio of the first class
KNNKINKNNKIN
MCMCTC (%)#FPMCMCTC (%)#FP

10098%0.543.230.766%2.740.027.569%2.5
0.821.419.171%1.421.617.965%1.5
0.540.227.682%2.537.225.485%2.1
0.820.117.879%1.320.216.683%1.1
95%0.539.628.076%2.435.021.084%2.1
0.821.119.164%1.721.016.383%1.7
0.536.823.887%3.030.817.988%2.6
0.820.016.587%1.919.214.292%1.3
90%0.539.133.968%2.130.820.589%1.5
0.821.622.059%1.620.816.589%1.5
0.536.928.087%2.728.118.592%2.1
0.820.018.685%1.918.414.394%1.6

30098%0.543.535.468%2.840.120.482%2.6
0.822.121.760%2.222.917.080%2.4
0.541.225.377%5.336.617.487%3.2
0.820.818.377%3.121.014.988%2.7
95%0.543.641.957%2.137.623.885%2.1
0.822.323.354%2.423.519.583%2.1
0.541.833.079%3.633.819.490%2.8
0.821.121.970%3.021.216.790%2.6
90%0.543.641.853%2.336.623.785%2.1
0.832.129.764%2.130.322.985%1.9
0.541.533.176%3.734.119.991%2.7
0.828.328.077%3.526.216.992%2.5

50098%0.544.741.358%2.539.921.682%2.5
0.822.522.551%2.323.117.178%2.5
0.543.529.075%4.937.917.287%4.0
0.820.518.969%3.321.314.888%3.2
95%0.545.744.844%6.238.829.374%2.3
0.823.225.340%2.326.622.574%2.2
0.544.540.569%3.335.822.388%3.3
0.821.124.666%3.121.918.188%3.0
90%0.545.745.146%2.539.326.083%2.0
0.828.232.250%2.329.725.477%2.1
0.546.545.258%2.836.929.586%3.3
0.826.130.359%3.221.823.286%2.8