Research Article

Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features

Figure 1

The flowchart of the proposed method. SAAC: split amino acid composition; DC: dipeptide composition; CTF: conjoint triad feature; PseACS: pseudoaverage chemical shift; SMOTE: syntactic minority oversampling technique.