Research Article

Heavyweight Statistical Alignment to Guide Neural Translation

Figure 1

The cost of training the baseline Transformer-L1 model.