Research Article

Heavyweight Statistical Alignment to Guide Neural Translation

Table 1

Some basic statistics of the datasets.

English-VietnameseTrainingDevelopmentTesting

Sentence pairs4202614821527
Average lengths19.2–26.217.8–24.520.6–28.3
Words806456–109920526315–3627631513–43286
Dictionaries36672–164414981–27206211–3462