Research Article
An Improved Transformer-Based Neural Machine Translation Strategy: Interacting-Head Attention
Table 7
Training time on WMT17 EN-DE training dataset.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Note. The units of h, m, and s stand for hour, minute, and second, respectively. |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Note. The units of h, m, and s stand for hour, minute, and second, respectively. |