| Datasets | Domain | Annotated | No. of texts | No. of sentences | No. of entities | No. of words | No. of vocabularies |
| WikipediaChinese | General | False | ā | 3,745,841 | ā | 337,063,331 | 14,261 | TCM-HN | Clinical | True | 29,636 | 155,566 | 318,337 | 14,009,494 | 3,008 | COVID-19 | Clinical | True | 6,105 | 29,663 | 201,567 | 1,726,665 | 2,248 | TCM-HB | Clinical | True | 18,555 | 105,075 | 247,291 | 6,394,902 | 2,778 |
|
|