Research Article

Training Method and Device of Chemical Industry Chinese Language Model Based on Knowledge Distillation

Table 3

The experimental situation of the model in the absence of different layers.

Distillation detailsLayerAcc (%)F1 (%)

BiLSTM-KDAll layer93.1391.07
No embedding layer87.4485.69
No hidden layer74.9771.57
No prediction layer84.2281.26
BiLSTMā€”72.3970.06