Research Article

A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website

Table 1

Experiment set statistics.

Websiteslanguages#webpages#sentences

TianShanChinese249,2383,839,000
Uyghur48,907427,000
RenMinChinese451,9725,500,000
Uyghur99,578590,000
KunLunChinese44,046641,000
Uyghur27,419324,000