Research Article
Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization
Table 1
The term-to-category feature appearance matrix.
| Features | C1 (2369) | C2 (237) | C3 (578) | C4 (3964) | C5 (582) | C6 (478) | C7 (717) | C8 (286) | C9 (486) | C10 (283) |
| Billion | 345 | 60 | 251 | 1828 | 110 | 344 | 461 | 26 | 992 | 36 | Company | 2128 | 6 | 303 | 1515 | 22 | 6 | 14 | 42 | 24 | 9 | April | 622 | 113 | 121 | 1578 | 304 | 210 | 243 | 121 | 202 | 156 | Bank | 487 | 5 | 67 | 527 | 24 | 780 | 1138 | 19 | 141 | 2 | Oil | 271 | 17 | 2018 | 252 | 48 | 28 | 36 | 210 | 94 | 32 |
| Total | 3853 | 201 | 2760 | 5700 | 508 | 1368 | 1892 | 418 | 1453 | 235 |
|
|