Research Article
Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features
Table 4
The predictive results of HSPs by using the combined feature of SAAC+DC+CTF+PseACS with and without SMOTE.
| Features with and without SMOTE (Y/N) | HSP families | OA (%) | HSP20 | HSP40 | HSP60 | HSP70 | HSP90 | HSP100 |
| PseACS+DC+SAAC+CTF | Y | Sn (%) | 100 | 98.33 | 100 | 100 | 100 | 100 | 99.72 | Sp (%) | 99.92 | 100 | 99.92 | 99.82 | 100 | 100 | MCC | 1 | 0.99 | 1 | 0.99 | 1 | 1 | Acc (%) | 99.93 | 99.72 | 99.93 | 99.85 | 100 | 100 | PseACS+DC+SAAC+CTF | N | Sn (%) | 94.35 | 98.89 | 81.13 | 90.29 | 75 | 91.36 | 94.91 | Sp (%) | 98.58 | 94.26 | 99.6 | 98.84 | 100 | 99.9 | MCC | 0.92 | 0.94 | 0.87 | 0.90 | 0.86 | 0.94 | Acc (%) | 97.89 | 96.93 | 98.26 | 97.75 | 99.4 | 99.59 |
|
|