Research Article
A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation
Table 1
Proportions of domains of general corpus.
| Domain | Sent. number | % |
| News | 279,962 | 24.60 | Novel | 304,932 | 26.79 | Law | 48,754 | 4.28 | Miscellaneous | 504,396 | 44.33 |
| Total | 1,138,044 | 100.00 |
|
|