Research Article
A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website
Table 1
Experiment set statistics.
| Websites | languages | #webpages | #sentences |
| TianShan | Chinese | 249,238 | 3,839,000 | Uyghur | 48,907 | 427,000 | RenMin | Chinese | 451,972 | 5,500,000 | Uyghur | 99,578 | 590,000 | KunLun | Chinese | 44,046 | 641,000 | Uyghur | 27,419 | 324,000 |
|
|