Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2014, Article ID 760301, 12 pages
http://dx.doi.org/10.1155/2014/760301
Research Article

Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation

Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory, Department of Computer and Information Science, University of Macau, Macau

Received 30 August 2013; Accepted 2 December 2013; Published 28 April 2014

Academic Editors: J. Shu and F. Yu

Copyright © 2014 Aaron L.-F. Han et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. W. Weaver, “Translation,” in Machine Translation of Languages: Fourteen Essays, W. Locke and A. Donald Booth, Eds., pp. 15–23, 1955. View at Google Scholar
  2. F. J. Och, “Minimum error rate training for statistical machine translation,” in Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 160–167, ACL Press, July 2003.
  3. J. B. Marino, R. E. Banchs, J. M. Crego et al., “N-gram-based machine translation,” Computational Linguistics, vol. 32, no. 4, pp. 527–549, 2006. View at Publisher · View at Google Scholar · View at Scopus
  4. G. Lembersky, N. Ordan, and S. Wintner, “Language models for machine translation-original vs. translated texts,” Journal of Computational Linguistics, vol. 38, no. 4, 2012. View at Google Scholar
  5. K. Papineni, S. Roukos, T. Ward, and W. J. Zhu, “BLEU: a method for automatic evaluation of machine translation,” in Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318, ACL Press, 2002.
  6. S. Banerjee and A. Lavie, “METE-OR: an Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments,” in Proceedings of the 43th Annual Meeting of the Association of Computational Linguistics, pp. 65–72, ACL Press, 2005.
  7. M. Snover, B. Dorr, R. Schwartz, L. Micciulla, and J. Makhoul, “A study of translation edit rate with targeted human annotation,” in Proceedings of the 7th Conference of the Association for Machine Translation in the Americas (AMTA '06), pp. 223–231, AMTA Press, August 2006. View at Scopus
  8. B. Chen and R. Kuhn, “Amber: a modi-fied bleu, enhanced ranking metric,” in Proceedings of the 6th Workshop on Statistical Machine Translation of the Association for Computational Linguistics, pp. 71–77, ACL Press, 2011.
  9. Y. Zhang, S. Vogel, and A. Waibel, “Interpreting BLEU/NIST scores: how much improvement do we need to have a better system?” in Proceedings of the Language Resources and Evaluation (LREC '04), ELRA Press, Lisbon, Portugal, 2004.
  10. M. Denkowski and A. Lavie, “Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems,” in Proceedings of the 6th Workshop on Statistical Machine translation of the Association for Computational Linguistics (ACL-WMT '11), pp. 85–91, ACL Press, 2011.
  11. B. Chen, R. Kuhn, and G. Foster, “Improving AMBER, an MT evaluation metric,” in Proceedings of the 7th Workshop on Statistical Machine translation of the Association for Computational Linguistics (ACL-WMT '12), pp. 59–63, ACL Press, 2012.
  12. G. Doddington, “Automatic evaluation of machine translation quality using n-gram co-occurrence statistics,” in Proceedings of the 2nd International Conference on Human Language Technology Research (HLT '02), pp. 138–145, Morgan Kaufmann, 2002.
  13. B. Wong and C. Kit, “ATEC: automatic evaluation of machine translation via word choice and word order,” Machine Translation, vol. 23, no. 2-3, pp. 141–155, 2009. View at Publisher · View at Google Scholar · View at Scopus
  14. A. L. F. Han, D. F. Wong, and L. S. Chao, “LEPOR: a robust evaluation metric for machine translation with augmented factors,” in Proceedings of the 24th International Conference on Computational Linguistics, pp. 441–450, International Committee of Computational Linguistics Press, December 2012.
  15. A. L. F. Han, D. F. Wong, L. S. Chao et al., “Language-independent model for machine translation evaluation with reinforced factors,” in Proceedings of the 14th International Conference of Machine Translation Summit, pp. 215–222, International Association for Machine Translation Press, September 2013.
  16. A. L. F. Han, D. F. Wong, L. S. Chao, L. He, S. Li, and L. Zhu, “Phrase tagset mapping for French and English treebanks and its application in machine translation evaluation,” in Language Processing and Knowledge in the Web, vol. 8105 of Lecture Notes in Computer Science, pp. 119–131, 2013. View at Google Scholar
  17. A. L. F. Han, Y. Lu, D. F. Wong, L. S. Chao, L. He, and J. Xing, “Quality estimation for machine translation using the joint method of evaluation criteria and statistical modeling,” in Proceedings of the 8th Workshop on Statistical Machine Translation (ACL-WMT '13), pp. 365–372, Association for Computational Linguistics Press, August 2013.
  18. M. Gamon, A. Aue, and M. Smets, “Sentence-level MT evaluation without reference translations: beyond language modeling,” in Proceedings of the 10th Annual Conference on European Association for Machine Translation (EAMT '05), pp. 103–111, EAMT Press, May 2005. View at Scopus
  19. J. S. Albrecht and R. Hwa, “Regression for sentence-level MT evaluation with pseudo references,” in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL '07), pp. 296–303, ACL Press, June 2007. View at Scopus
  20. L. Specia and J. Giménez, “Combining confidence estimation and reference-based metrics for segment-level MT evaluation,” in Proceedings of the 9th Biennial Conference of the Association for Machine Translation in the Americas (AMTA '10), AMTA Press, November 2010. View at Scopus
  21. L. Specia, D. Raj, and M. Turchi, “Machine translation evaluation versus quality estimation,” Machine Translation, vol. 24, no. 1, pp. 39–50, 2010. View at Publisher · View at Google Scholar · View at Scopus
  22. M. Popović, D. Vilar, E. Avramidis, and A. Burchardt, “Evaluation without references: IBM1 scores as evaluation metrics,” in Proceedings of the 6th Workshop on Statistical Machine Translation, Association for Computational Linguistics (ACL-WMT '11), pp. 99–103, ACL Press, 2011.
  23. Y. Mehdad, M. Negri, and M. Federico, “Match without a referee: evaluating MT adequacy without reference translations,” in Proceedings of the 7th Workshop on Statistical Machine Translation, ACL Press, 2012.
  24. E. Avramidis, “Comparative quality estimation: automatic sentence-level ranking of multiple machine translation outputs,” in Proceedings of 24th International Conference on Computational Linguistics (COLING '12), pp. 115–132, ACL Press, 2012.
  25. C. Callison-Burch, P. Koehn, C. Monz, M. Post, R. Soricut, and L. Specia, “Findings of the 2012 workshop on statistical machine translation,” in Proceedings of the 7th Workshop on Statistical Machine Translation, pp. 10–51, ACL Press, 2012.
  26. M. Felice and L. Specia, “Linguistic features for quality estimation,” in Proceedings of the 7th Workshop on Statistical Machine Translation, pp. 96–103, ACL Press, 2012.
  27. S. Petrov, D. Das, and R. McDonald, “A universal part-of-speech tagset,” in Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC '12), European Language Resources Association (ELRA) Press, 2012.
  28. M. Marcus, B. Santorini, and M. A. Marcinkiewicz, “Building a large annotated corpus of English: the Penn Treebank,” Computational Linguistics, vol. 19, no. 2, pp. 313–330, 1993. View at Google Scholar
  29. W. Skut, B. Krenn, T. Brants, and H. Uszkoreit, “An annotation scheme for free word order languages,” in Proceedings of the Applied Natural Language Processing, ACL Press, 1997.
  30. S. Petrov, L. Barrett, R. Thibaux, and D. Klein, “Learning accurate, compact, and interpretable tree annotation,” in Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL '06), pp. 433–440, ACL Press, July 2006. View at Scopus
  31. M. Macháček and O. Bojar, “Results of the WMT13 metrics shared task,” in Proceedings of the 8th Workshop on Statistical Machine Translation, pp. 45–51, ACL Press, 2013.
  32. C. Callison-Burch, P. Koehn, C. Monz, and O. F. Zaidan, “Findings of the 2011 workshop on statistical machine translation,” in Proceedings of the 6th Workshop on Statistical Machine Translation of the Association for Computational Linguistics (ACL-WMT '11), pp. 22–64, ACL Press, 2011.