Research Article

Recognition of 27-Class Protein Folds by Adding the Interaction of Segments and Motif Information

Table 3

Prediction accuracies of different parameters in the testing set (%).

FoldAA + ACCA + ACC + MA + ACC + M + PA + ACC + M + P (5-fold cross validation)The results of Liu et al. [27]Ding and Dubchak’s dataset [11]
A + ACC + M + P

121.4371.4371.4371.4375.00 (0.0252)78.5100.00
210.0070.0070.0080.0095.00 (0.0000)90.0100.00
360.0090.0091.1191.1192.86 (0.0026)75.575.00
44.1783.3375.0075.0081.63 (0.0000)54.187.50
512.5025.0012.5025.0018.75 (0.0187)25.077.78
60.0060.8752.1752.1775.00 (0.0342)39.166.67
787.0691.7689.4190.5989.47 (0.0114)82.379.55
811.1127.7827.7838.8941.67 (0.0000)55.575.00
945.8350.0050.0058.3370.83 (0.0421)70.884.62
1023.5335.2947.0652.9457.14 (0.0255)47.066.67
1124.3956.1048.7858.5470.73 (0.0185)43.937.50
120.0046.4364.2960.7154.39 (0.0096)60.789.47
130.0030.0050.0060.0066.67 (0.0426)10.050.00
1437.5056.2562.5062.5081.82 (0.0000)75.025.00
1553.3340.0040.0046.6767.74 (0.0136)40.0100.00
1686.9695.6598.91100.0098.92 (0.0144)89.166.67
170.0020.0020.0020.0020.00 (0.0097)20.091.67
1811.1130.5647.2261.1168.49 (0.0894)16.638.46
1937.5081.25100.00100.00100.00 (0.0300)81.262.96
2026.0372.6090.4189.0491.84 (0.0398)87.641.67
2130.5650.0075.0072.2272.60 (0.0217)52.775.00
2222.5040.0062.5057.5065.82 (0.0113)50.041.67
2327.2745.4590.9190.9195.46 (0.0107)78.757.15
240.0016.6750.0066.6741.67 (0.0373)50.025.00
2512.8256.4161.5461.5469.23 (0.0233)30.712.50
2651.5288.8990.9192.9386.00 (0.0104)67.662.96
27100.0075.6187.8092.6892.68 (0.0122)1.00096.30
43.6668.8076.2578.3881.16 (0.0028)66.570.24

Note: A means amino acid composition (20 dimensions), A + ACC means amino acid composition and the interaction of segments (164 dimensions), A + ACC + M means amino acid composition, the interaction of segments, and motif frequency (290 dimensions), and A + ACC + M + P means amino acid composition, the interaction of segments, motif frequency, and predicted secondary structure information (296 dimensions); means the overall accuracy; the standard deviation values are in the parenthesis of the sixth column, the penultimate column is the results of Liu et al. [27] with the same dataset, and the last column is our results of the dataset built by Ding and Dubchak [11].