Research Article

A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website

Table 2

The size and accuracy of obtaining parallel sentences in different number of training corpus.

ModelThe number of training parallel sentences
2,0005,00010,00020,00040,000

LSTMsize13,00033,00065,00092,000126,000
accuracy0.60.710.780.810.82
C-BiRNNsize14,00028,00058,00086,000121,000
accuracy0.580.630.680.700.72