Research Article

Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids

Table 1

Results of 10-fold cross validation of SVM model on 1,072 PPIs between 36 RNA viruses and 812 human proteins with different ratios of positive to negative instances.

P : NDatasetSn (%)Sp (%)Acc (%)PPV (%)NPV (%)MCCAUC

1 : 1188.2497.3492.7997.0789.220.8590.963
281.0394.3687.793.4983.260.7610.931
377.7494.0485.8992.8880.860.7280.926
mean  SD82.34  4.3995.25  1.4988.79  2.9294.48  1.8584.45  3.510.78  0.060.94  0.02

1 : 2164.8997.3486.5292.4184.720.6930.893
258.3197.5784.4892.3182.40.6460.886
363.6496.0885.2789.0484.090.6610.891
mean  SD62.28  2.8597  0.6585.42  0.8491.25  1.5783.74  0.980.67  0.020.89  0.00

1 : 3146.2498.2885.2789.9484.580.580.850
246.8798.5985.6691.7284.770.590.863
349.3797.2885.3185.8385.220.5760.858
mean  SD47.49  1.3598.05  0.5685.41  0.1889.16  2.4784.86  0.270.58  0.010.86  0.01

Sn: sensitivity, Sp: specificity, Acc: accuracy, PPV: positive predictive value, NPV: negative predictive value, MCC: Matthews correlation coefficient, and AUC: the area under the ROC curve.