Review Article

Extracting Parallel Sentences from Nonparallel Corpora Using Parallel Hierarchical Attention Network

Table 3

The precision (P), recall (R), and F1 scores of extracting parallel sentences.

DataEn-FrEn-DeEn-Zh
SMTNMTSMTNMTSMTNMT

Baseline23.7122.3221.6221.3521.117.32
Top20K24.84 (+1.13)25.42 (+3.1)23.38 (+1.76)25.06 (+3.71)23.21 (+2.11)24.56 (+7.24)
Top50K26.16 (+2.45)26.35 (+4.8)24.63 (+3.01)26.42 (+5.07)24.66 (+3.56)25.89 (+8.57)
Top100K28.31 (+3.6)27.48 (+5.03)25.72 (+4.1)27.67 (+6.32)25.78 (+4.68)27.02 (+9.7)
Top200K29.37 (+4.66)29.51 (+6.06)26.76 (+5.14)28.73 (+7.38)26.86 (+5.76)28.13 (+10.81)
Top300K30.39 (+5.68)30.55 (+8.10)27.79 (+6.17)29.80 (+8.45)27.91 (+6.81)29.18 (+11.86)
Top400K30.41 (+6.70)30.57 (+9.12)28.83 (+7.21)30.82 (+9.47)28.92 (+7.82)30.21 (+12.89)
Top500K31.56 (+7.85)31.58 (+10.13)30.14 (+8.52)31.85 (+10.50)29.93 (+8.83)31.22 (+13.9)