Research Article

A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website

Table 3

Statistics of the size and precision of parallel sentences extracted from multilingual websites.

ModelTraining corpus#sentences#precision

Bitextor&LSTM30,000117,9000.70
40,000124,2000.70
Ours&LSTM30,000120,2000.81
40,000127,9000.82