Research Article

Improving Loanword Identification in Low-Resource Language with Data Augmentation and Multiple Feature Fusion

Table 5

Loanword identification experimental results on different methods.

DonorModelLoanword identification results (%)
RF1F1 (+)

RussianRule (+)72.0472.8969.3170.1870.6571.28
CRF (+)71.6372.4567.2868.1569.3970.23
BLSTM-CNN (+)71.4572.2670.5071.3170.9771.78
ClEmbedding (+)73.1273.9471.8472.6272.4773.27
Ours (+)74.8075.6273.6474.2074.2274.90

ArabicRule (+)69.0569.8468.1769.0268.6169.43
CRF (+)69.8370.6567.4268.2968.6069.45
BLSTM-CNN (+)68.7069.5269.8570.6769.2770.09
ClEmbedding (+)72.9573.7672.0372.8572.4973.30
Ours (+)73.9174.6272.3573.0673.1273.83

TurkishRule (+)72.0272.8669.8770.5070.9371.66
CRF (+)71.4672.2969.0269.9570.2271.10
BLSTM-CNN (+)71.2572.0470.4371.1870.8471.61
ClEmbedding (+)72.9673.6473.0873.8573.0273.74
Ours (+)75.2476.0974.3675.1474.8075.61

ChineseRule (+)70.3271.1369.7770.5870.0470.85
CRF (+)70.8571.6469.2470.0570.0470.84
BLSTM-CNN (+)70.5871.3469.9870.7970.2871.06
ClEmbedding (+)71.6772.4871.3572.1471.5172.31
Ours (+)74.3075.0772.8873.9573.5874.51