Research Article
Sublemma-Based Neural Machine Translation
Table 4
Statistical summary of the datasets.
| Russian/Vietnamese | Training | Development | Testing |
| Average sentence length | 16.1/18.1 | 16.2/21.2 | 16.2/21.3 | Unique tokens | 73205/25939 | 7202/2646 | 7120/2692 | All tokens | 766446/866175 | 24257/31741 | 24363/31948 |
|
|