Research Article
Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts
Table 3
The annotation results of each dataset and the proportion of machine annotations.
| Datasets | No. of texts | No. of entities | No. of entities annotated manually | No. of entities annotated by machine | Machine annotation proportion (%) |
| TCM-HN | 29,636 | 318,337 | 51,925 | 266,412 | 83.69 | COVID-19 | 6,105 | 201,567 | 39,796 | 161,771 | 80.25 | TCM-HB | 18,555 | 247,291 | 52,797 | 194,494 | 78.65 |
|
|