Research Article

Natural Language Processing Algorithms for Normalizing Expressions of Synonymous Symptoms in Traditional Chinese Medicine

Table 5

Model performance on TDS test data sets.

ModelAccuracyPrecisionRecallF1-score

Encoder (Char)-Decoder (Char)0.7980 ± 0.0078a,b0.7876 ± 0.0147a,b0.7678 ± 0.0088a,b0.7775 ± 0.0106a,b
Encoder (Char)-Decoder (Label)0.7974 ± 0.0050a,b0.8060 ± 0.0081a,b0.7690 ± 0.0062a,b0.7870 ± 0.0056a,b
Encoder (Word)-Decoder (Label)0.7892 ± 0.0069a,b0.7979 ± 0.0100a,b0.7595 ± 0.0066a,b0.7782 ± 0.0074a,b
Encoder (Word)-Decoder (Word)0.7904 ± 0.0079a,b0.7805 ± 0.0122a,b0.7594 ± 0.0078a,b0.7698 ± 0.0092a,b
Encoder (Char)-Classification0.7559 ± 0.0056c0.8560 ± 0.0125c0.7278 ± 0.0057c0.7866 ± 0.0058c
Encoder (Word)-Classification0.7652 ± 0.0042c0.8557 ± 0.0065c0.7364 ± 0.0038c0.7915 ± 0.0028c
BERT-UniLM (Char)0.8274 ± 0.0087c0.8152 ± 0.0115c0.8043 ± 0.0082c0.8097 ± 0.0094c
BERT-UniLM (Label)0.8248 ± 0.0045c0.8230 ± 0.0066c0.7970 ± 0.0056c0.8098 ± 0.0037c
BERT-Classification0.8568±0.00290.8870±0.00390.8298±0.00370.8574±0.0026

Note. The results are expressed as mean ± SD, and the threshold value of the sigmoid function was 0.1. a, compared with BERT-UniLM (Char); b, compared with BERT-UniLM (Label). c, compared with BERT-Classification.