Research Article
SubRF_Seq: Identification of Sub-Golgi Protein Types with Random Forest with Partial Sequence Information
Table 4
Performance of imbalance of positive and negative samples of the data set on the classification effect.
| Cutting | Classifier | Encoding | SEV | 10-flod CV | Acc (%) | AUC | Acc (%) | AUC |
| 20 + 3 | RF | EAAC | 82.81 | 0.8544 | 78.13 | 0.7813 | 4 + 17 | 85.94 | 0.8492 | 67.95 | 0.6161 | 18 + 25 | 82.81 | 0.7828 | 79.69 | 0.4615 | 20 + 11 | 79.69 | 0.7828 | 77.75 | 0.7681 | 11 + 11 | 82.81 | 0.7730 | 80.06 | 0.5716 |
|
|