Research Article

Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization

Table 1

The term-to-category feature appearance matrix.

FeaturesC1
(2369)
C2
(237)
C3
(578)
C4
(3964)
C5
(582)
C6
(478)
C7
(717)
C8
(286)
C9
(486)
C10
(283)

Billion3456025118281103444612699236
Company2128630315152261442249
April6221131211578304210243121202156
Bank487567527247801138191412
Oil2711720182524828362109432

Total385320127605700508136818924181453235