Research Article

Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids

Table 8

The number of host proteins shared by training (TR) and test (TS) datasets used for assessing the applicability of the SVM model to new viruses and to new hosts.

DatasetTR1TS1TR1TS2TR1TS3TR1TS4TR1TS5

#PPIs638515638306383776383196381578
#Virus proteins25112512251025112546
#Host proteins499424499274993074992984991056
#Host proteins common to TR and TS63 (14.9%)5 (18.5%)68 (22.1%)22 (7.4%)122 (11.6%)

DatasetTR2TS6TR2TS7TR2TS8TR2TS9TR2TS10
#PPIs689191689125689866895768978
#Virus proteins351163534352435103527
#Host proteins52214152287522795223852264
#Virus proteins common to TR and TS9 (7.8%)1 (2.9%)4 (16.7%)0 (0.0%)0 (0.0%)

The numbers in parentheses represent the proportion of common proteins to proteins in test datasets.