Research Article

Feature Reduction Based on Genetic Algorithm and Hybrid Model for Opinion Mining

Table 17

Confusion matrix of sentiment classification of the two-class reviews using GA algorithm based bagging technique.

Class/attribute weightAttribute weight ≥ 0.510Attribute weight ≥ 0.400Attribute weight ≥ 0.300Attribute weight ≥ 0.200Attribute weight ≥ 0.100
Actual neg.Actual pos.Actual neg.Actual pos.Actual neg.Actual pos.Actual neg.Actual pos.Actual neg.Actual pos.

Book
Predicted neg. (type II error)43 08 (8.79)6105 (5.49)6809 (9.89)7812 (13.18)8209 (9.89)
Predicted pos. (type I error)48 (48.00)9230 (30.00)9523 (23.00)9113 (13.00)8809 (9.00)91
Overall error rate (%)29.3129.3116.7513.0809.42
Average accuracy (%)70.7670.7683.2986.9290.63

DVD
Predicted neg. (type II error)7109 (9.00)7007 (7.00)7207 (7.00)8013 (13.00)8711 (11.00)
Predicted pos. (type I error)29 (29.29)9030 (30.30)9228 (28.28)9220 (20.20)8613 (13.13)88
Overall error rate (%)19.0918.5917.5816.5812.06
Average accuracy (%)80.9581.4282.4583.4788.00

Electronics
Predicted neg. (type II error)4805 (5.00)6805 (5.00)7302 (2.00)8708 (8.00)9007 (7.00)
Predicted pos. (type I error)52 (52.00)9532 (32.00)9527 (27.00)9813 (13.00)9210 (10.00)93
Overall error rate (%)28.5018.5014.0010.5008.50
Average accuracy (%)71.5081.5085.5089.5091.05

Kitchen
Predicted neg. (type II error)7945 (49.45)7731 (34.06)8125 (27.47)8021 (23.07)7210 (10.98)
Predicted pos. (type I error)12 (12.12)5414 (14.14)6810 (10.10)7411 (11.11)7819 (19.19)89
Overall error rate (%)30.0023.6818.4216.8415.26
Average accuracy (%)70.0076.3281.5883.1684.74

Movie
Predicted neg. (type II error)6220 (20.00)7616 (16.00)919 (9.00)944 (4.00)973 (3.00)
Predicted pos. (type I error)38 (38.00)8024 (24.00)849 (9.00)916 (6.00)963 (3.00)97
Overall error rate (%)29.0020.0007.0005.0003.00
Average accuracy (%)71.0080.0091.0095.0097.00