Research Article

Distant Supervision with Transductive Learning for Adverse Drug Reaction Identification from Electronic Medical Records

Table 1

A list of previous studies on ADR identification from unstructured text.

Data sourceLiteratureYearSizeLabel numberLabeling methodNERMethod

Supervised learning
EMRAramaki et al. [10]20103012 notesA, O (2)HCRFPattern-based
Sohn et al. [11]2011237 notesA, O (2)HcTAKESPattern-based, DT C4.5
Henriksson et al. [26]2015400 notesA, I, O (3)HCRFWord embedding, RF
Casillas et al. [12]2016n/aA, O (2)HFreeLing-MedPattern-based, SVM, RF
LiteraturePeng et al. [16]201618,410 abstractsA, O (2)H, DSDictionary, tmChem, DNormFeature-based, SVM
Social mediaSegura-Bedmar et al. [33]201584,000 messagesA, I (2)DSGATEShallow linguistic kernel, distant supervision
Nikfarjam et al. [17]20158800 blog sentences, 3200 tweetsA, I, O (3)HCRFWord embedding, CRF
Jenhani et al. [18]201680,000 tweetsA, O (2)R, ODINDictionary, Stanford CoreNLPRule-base, feature-based, DT, SVM, LR, NB
Liu et al. [34]20161800 blog sentences, 500 tweetsA, O (2)HMetaMapFeature-based, tree kernel-based, ensemble method

Semisupervised learning
EMRTaewijit and Theeramunkong [13]20161.5 M sentencesA, I (2)DSMetaMapDistant supervision, OpenIE [35], pattern-based
LiteratureKang et al. [36]20141644 abstractsA, O (2)HPeregrineHierarchical graph-based, shortest path
Social mediaLiu and Chen [37]2015400 sentencesA, I, O (3)HMetaMapDependency tree, TSVM [38]

Unsupervised learning
EMRWang et al. [39]200925,074 notesNoneNoneMedLEECo-occurrence
LiteratureXu and Wang [14]2014119 M sentencesNoneNoneParse treePattern-based, ranking
Social mediaFeldman et al. [15]20150.1~1 M messagesNoneNoneDictionary, patternHPSG-based parser, postprocessing of relation merging

Labels: A = ADR; I = IND; O = other (ADR cause, ADR outcome, non-ADR, negated ADR, others); labeling method: DS = distant supervision, H = human; R = rule-based.