Research Article
Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features
Table 2
The number of sequences in the independent dataset.
| Families | HGNC dataset | RICE dataset | Wang et al. | Sarkar et al. |
| HSP20 | 11 | 14 | — | HSP40 | 49 | — | — | HSP60 | 15 | 4 | — | HSP70 | 17 | 7 | 24 | HSP90 | 4 | 3 | — | HSP100 | — | 3 | — | Total | 96 | 31 | 24 |
|
|