Review Article

Applications of Natural Language Processing in Biodiversity Science

Table 3

Existing IE systems for biology [17ā€“26].

SystemApproachStructure of TextKnowledge inApplication domainReference

AkanePPIshallow parsingsentence-split, tokenized, and annotatedprotein interactions[17]
EMPathIEpattern matchingtextEMP databaseenzymes[18]
PASTApattern matchingtextbiological lexiconsprotein structure[19]
BioIEpattern matchingxmldictionary of termsbiomedicine[20]
BioRATpattern matching, sub-language drivencould be xml, html, text or asn.1, can do full-length pdf papers (converts to text)dictionary for protein and gene names, dictionary for interactions, and synonyms; text pattern templatebiomedicine[21]
Chilibotshallow parsingnot sure what was used in paper, but could be xml, html, text or asn.1nomenclature dictionarybiomedicine[22]
Dragon Toolkitmixed syntactic semantictextdomain ontologiesgenomics[23]
EBIMedpattern matchingxmldictionary of termsbiomedicine[24]
iProLINKshallow parsingtextprotein name dictionary, ontology, and annotated corporaproteins[25]
LitMinermixed syntactic semanticweb documentsDrosophila research[26]