Research Article

Selecting Negative Samples for PPI Prediction Using Hierarchical Clustering Methodology

Table 2

New sizes of datasets after filtering process.

Datasets Size of filtering training set with GSN set obtained using the presented hierarchical clustering Size of filtering training set with randomly selected GSN set Size of filtering training set with β€œbalanced’’ GSN set obtained from the approach by Yu et al. [38]

Binary-GS 933 937 987
Ito-core 680 686 700
LC-multiple 2362 2380 2468
Uetz-screen 574 584 594
Random negative dataset 1 4893 4894 4894
Random negative dataset 2 4895 4894 4898
𝑅 t e s t _ 1 3 4735 4995 4992
𝑅 t e s t _ 2 3 4788 4995 4994
𝑅 t e s t _ 3 3 4814 4991 4991
𝑅 t e s t _ 4 3 4844 4987 4992
𝑅 t e s t _ 5 3 4854 4983 4986
𝑅 t e s t _ 6 3 4816 4991 4994
𝑅 t e s t _ 7 3 4837 4985 4990
𝑅 t e s t _ 8 3 4797 4994 4994
𝑅 t e s t _ 9 3 4873 4994 4996