Research Article

A High Accurate Multiple Classifier System for Entity Resolution Using Resampling and Ensemble Selection

Table 1

Accuracy comparison.

DatasetBaggingDREPGentle AdaBoostREARES

Synthetic databreast cancer0.8996 ± 0.00020.9088 ± 0.00020.9215 ± 0.00030.9045 ± 0.00010.9327 ± 0.0002
innosphere0.9647 ± 0.00010.9732 ± 00.9946 ± 0.00010.9832 ± 00.9911 ± 0
dermatology0.9541 ± 0.00030.9676 ± 0.00020.9649 ± 0.00020.9588 ± 0.00010.9825 ± 0.0001
ILPD0.9619 ± 0.00010.9721 ± 00.9964 ± 00.9579 ± 0.00010.9929 ± 0.0002
seismic0.9779 ± 00.9756 ± 00.9984 ± 00.9878 ± 00.9942 ± 0
abalone0.9812 ± 00.9812 ± 0.00010.9880 ± 00.9833 ± 00.9940 ± 0
vote0.7592 ± 0.00010.7495 ± 0.00050.9515 ± 0.00010.8155 ± 0.00040.8447 ± 0.0003
biodeg0.8036 ± 0.00010.7643 ± 0.00020.9643 ± 0.00010.8155 ± 0.00040.8214 ± 0.0002
glass0.9255 ± 0.00020.9275 ± 0.00040.9804 ± 0.00010.9216 ± 0.00060.9412 ± 0.0003
diabets0.9429 ± 0.00020.9457 ± 0.00080.9837 ± 00.8891 ± 0.00050.9457 ± 0

Real dataabt_buy0.9417 ± 00.9444 ± 00.9372 ± 0.00010.9465 ± 00.9535 ± 0
amazon_gp0.9669 ± 00.9640 ± 00.9525 ± 0.00010.9729 ± 00.9796 ± 0
dblp_acm0.9938 ± 00.9975 ± 00.9994 ± 00.9966 ± 00.9966 ± 0
dblp_scholar0.9809 ± 00.9750 ± 00.9783 ± 00.9817 ± 00.9848 ± 0
cora0.9407 ± 0.00010.9650 ± 00.9550 ± 00.9465 ± 00.9600 ± 0

win/tie/loss9/2/412/0/38/3/414/1/0