Research Article

Distant Supervision with Transductive Learning for Adverse Drug Reaction Identification from Electronic Medical Records

Table 5

The effectiveness comparison on fivefold cross-validation of text transformation across three types of document representation using MIL-iEM with soft decision making (MIL-iEM-S) and hard decision making (MIL-iEM-H).

ModelsSoft decision makingHard decision making
BTFTFIDFBTFTFIDF
PRF1PRF1PRF1PRF1PRF1PRF1

MIL-iEM-S-TMIL-iEM-H-T
S–P0.8580.3080.4540.8580.3080.4540.8570.3070.4520.8560.3270.4730.8560.3270.4730.8580.3300.477
S–W0.8790.5990.7120.8730.6000.7110.8710.5890.7030.8710.6510.7450.8630.6420.7360.8730.6500.745
L–P0.8900.4600.6060.8900.4600.6060.8820.4510.5970.8870.5000.6390.8870.5000.6390.8810.4980.636
L–W0.8680.6090.7160.8630.6110.7160.8730.6040.7140.8660.6590.7480.8630.6530.7440.8680.6620.752
BOW0.7550.6240.6830.7800.6280.6960.7650.6240.6870.7440.6420.6900.7300.6460.6850.7260.6420.682
MIL-iEM-S-TMIL-iEM-H-T
S–P0.8450.8360.8400.8440.8380.8410.8460.8300.8380.6200.9850.7610.6200.9850.7610.6210.9850.762
S–W0.7830.7920.7880.7840.8010.7920.7870.7430.7640.6860.9710.8040.6830.9690.8020.6910.9670.806
L–P0.8400.8160.8280.8400.8160.8280.8360.7990.8170.6510.9740.7810.6520.9740.7810.6510.9730.780
L–W0.7850.7970.7910.7960.7990.7980.7770.7140.7440.6930.9620.8050.6890.9600.8020.6960.9600.807
BOW0.6920.9270.7930.7490.8500.7970.7350.8610.7930.6320.9730.7660.6460.9620.7730.6490.9600.774
MIL-iEM-S-TMIL-iEM-H-T
S–P0.8400.8360.8380.8420.8340.8380.8410.8190.8300.6450.7550.6960.6440.7540.6950.6300.7570.688
S–W0.7730.7900.7820.7820.7920.7870.7820.7320.7560.6660.8250.7370.6490.8340.7300.6400.8560.732
L–P0.8330.8300.8320.8330.8280.8310.8350.8010.8180.6680.8140.7340.6680.8140.7340.6560.8410.737
L–W0.7830.7960.7890.7990.8050.8020.7780.7100.7420.6570.8270.7320.6420.8230.7220.6410.8630.736
BOW0.6910.9000.7820.7480.8340.7890.7340.8410.7840.6550.7460.6980.6660.8010.7270.6750.7940.730

B: binary frequency; TF: term frequency; TFIDF: term frequency-inverse document frequency.