Research Article

A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation

Table 1

Proportions of domains of general corpus.

DomainSent. number%

News279,96224.60
Novel304,93226.79
Law48,7544.28
Miscellaneous504,39644.33

Total1,138,044100.00