Review Article

Biomedical Relation Extraction: From Binary to Complex

Table 6

Features sets and classifiers employed in machine learning-based approaches for event extraction.

References Feature sets Classifier

[63] (1) N-grams (merging the attributes of 2 to 4 consecutive tokens); (2) individual component features for each token and edge in a path; (3) semantic node features (the attributes of the two terminal event/entity nodes of the potential event argument edge); (4) frequency features (the length of the shortest path and the number of named entities and event nodes, per type, in the sentence) Multiclass SVM

[83](1) Trigger type; (2) argument terms; (3) argument type; (4) argument supertype; (5) trigger and argument; (6) trigger and argument POS; (7) parse tree path; (8) voice of sentence (active or passive); (9) trigger and argument partial paths; (10) trigger subcategorization SVM

[73] (1) Words and POS in a window around the trigger; (2) distances between the trigger and the two nearest annotated proteins (left and right) and the theme candidate C4.5 decision tree

[55] (1) Three stemmed consecutive words from the subsentence spanning the event; (2) lexical and syntactic information of triggers; (3) size of the subgraph; (4) bag of words; (5) length of the subsentence; (6) extra features for regulation events; (7) vertex walks which consist of two vertices and their connecting edge SVM