Research Article

Evaluating Word Representation Features in Biomedical Named Entity Recognition Tasks

Table 1

Counts of different types of entities in two corpora used in this study.

Corpus BioCreAtIvE II GMJNLPBA
Gene/proteinTotalProteinDNARNACell lineCell typeTotal

Training18,26518,26530,2699,5349513,8306,71851,301
Test6,3316,3315,0671,0561185001,9218,662