Research Article

Selecting Negative Samples for PPI Prediction Using Hierarchical Clustering Methodology

Figure 5

Comparison of accuracy obtained in negative datasets for the two trained models: the SVM model trained using the training set formed by the GSP set and the GSN set obtained using the proposed hierarchical clustering method (clustered) and the SVM model trained using the training set where the GSN set was randomly selected (Rand. RBF-SVM) and the balanced RBF-SVM is the SVM model trained using the training set formed by the GSP set and the GSN set obtained using the approach to create a β€œbalanced” negative. Please note that Rtest_1, 𝑅 𝑑 𝑒 𝑠 𝑑 _2, 𝑅 𝑑 𝑒 𝑠 𝑑 _3, 𝑅 𝑑 𝑒 𝑠 𝑑 _4, 𝑅 𝑑 𝑒 𝑠 𝑑 _5, 𝑅 𝑑 𝑒 𝑠 𝑑 _6, 𝑅 𝑑 𝑒 𝑠 𝑑 _7, 𝑅 𝑑 𝑒 𝑠 𝑑 _8, and 𝑅 𝑑 𝑒 𝑠 𝑑 _9 correspond to: 𝑅 t e s t _ 1 3 , 𝑅 t e s t _ 2 3 , 𝑅 t e s t _ 3 2 , 𝑅 t e s t _ 4 3 , 𝑅 t e s t _ 5 3 , 𝑅 t e s t _ 6 3 , 𝑅 t e s t _ 7 3 , 𝑅 t e s t _ 8 3 , and 𝑅 t e s t _ 9 3 . And the β€œbalanced” negative set is created using the approach by Yu et al.[38].
897289.fig.005