Research Article

Sublemma-Based Neural Machine Translation

Table 4

Statistical summary of the datasets.

Russian/VietnameseTrainingDevelopmentTesting

Average sentence length16.1/18.116.2/21.216.2/21.3
Unique tokens73205/259397202/26467120/2692
All tokens766446/86617524257/3174124363/31948