Research Article

Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features

Table 2

The number of sequences in the independent dataset.

FamiliesHGNC datasetRICE dataset
Wang et al.Sarkar et al.

HSP201114
HSP4049
HSP60154
HSP7017724
HSP9043
HSP1003
Total963124