Research Article

Metabolomic Biomarker Identification in Presence of Outliers and Missing Values

Table 1

Performance measurement of different methods using average RMSE between original and reconstructed data matrix for different percentage of missing values in absence and presence of outliers using simulated datasets.

Methods with different conditionsWithout outliers3% outliers5% outliers7% outliers10% outliers15% outliers

For 10% missing valuesZero1.561.922.102.352.482.69
kNN0.771.421.712.052.232.43
RF0.821.451.732.072.242.45
Proposed0.771.151.191.241.371.48

For 15% missing valuesZero2.192.422.562.722.913.22
kNN0.911.471.772.052.362.51
RF0.971.511.802.082.392.52
Proposed0.921.241.281.341.501.64

For 20% missing valuesZero2.512.712.882.963.143.37
kNN1.001.581.942.122.452.69
RF1.071.631.982.152.482.75
Proposed1.011.291.341.411.591.71