Research Article

Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts

Table 9

The symptom extraction performance of models on heterogenous data (TCM-HB).

Training datasetTCM-HBTCM-HN
ModelsPrecisionRecallF1-scorePrecisionRecallF1-score

BiLSTM-CRF0.76820.78650.77720.65120.58650.6171
GloVeWiki-BiLSTM-CRF0.77010.78700.77850.65100.60970.6297
GloVeMedical-BiLSTM-CRF0.77050.79570.78290.65750.61040.6331
W2VWiki-BiLSTM-CRF0.76860.79640.78220.64360.62610.6347
W2VMedical-BiLSTM-CRF0.77340.79960.78630.66230.61390.6372
BERT-CRF0.77190.81790.79430.65660.61980.6377
BERT-BiLSTM0.76880.81450.79100.64000.64060.6403
Phenonizer0.77270.81890.79520.64380.64460.6442