Table of Contents Author Guidelines Submit a Manuscript
Computational Intelligence and Neuroscience
Volume 2016 (2016), Article ID 9821608, 11 pages
http://dx.doi.org/10.1155/2016/9821608
Research Article

A Character Level Based and Word Level Based Approach for Chinese-Vietnamese Machine Translation

1Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City 700000, Vietnam
2Faculty of Information Technology, VNU-HCM University of Science, Ho Chi Minh City 700000, Vietnam

Received 11 March 2016; Accepted 8 May 2016

Academic Editor: Stefan Haufe

Copyright © 2016 Phuoc Tran et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. Y. M. Oh, F. Pellegrino, E. Marsico, and C. Coupé, “A quantitative and typological approach to correlating linguistic complexity,” in Proceedings of the 5th Conference on Quantitative Investigations in Theoretical Linguistics, Leuven, Belgium, September 2013.
  2. L. Dinh Khan, Vietnamese Vocabulary Having Chinese Origin, National University of HCMC Press, 2002 (Vietnamese).
  3. Y.-S. Lee, “Morphological analysis for statistical machine translation,” in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL '04), pp. 57–62, 2004.
  4. S. Goldwater and D. McClosky, “Improving statistical MT through morphological analysis,” in Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT '05), pp. 676–683, October 2005. View at Scopus
  5. J. Xu, R. Jens, and H. Ney, “Do we need Chinese word segmentation for statistical machine translation?” in Proceedings of the 3rd SIGHAN Workshop on Chinese Language Learning, pp. 122–128, Association for Computational Linguistics, Stroudsburg, Pa, USA, 2004.
  6. Y. Ma and A. Way, “Bilingually motivated domain-adapted word segmentation for statistical machine translation,” in Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL '09), pp. 549–557, April 2009. View at Scopus
  7. M. Paul, A. Finch, and E. Sumita, “Integration of multiple bilingually-learned segmentation schemes into statistical machine translation,” in Proceedings of the Joint 5th Workshop on Statistical Machine Translation and MetricsMATR (WMT '10), pp. 400–408, Uppsala, Sweden, 2010.
  8. M.-H. Bai, K.-J. Chen, and J. S. Chang, “Improving word alignment by adjusting Chinese word segmentation,” in Proceedings of the 3rd International Joint Conference on Natural Language Processing, vol. 1, pp. 249–256, 2008.
  9. X. Wang, M. Utiyama, A. Finch, and E. Sumita, “Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation,” in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '14), pp. 1654–1664, Doha, Qatar, October 2014. View at Publisher · View at Google Scholar
  10. C. Chu, T. Nakazawa, D. Kawahara, and S. Kurohashi, “Exploiting shared Chinese characters in Chinese word segmentation optimization for Chinese-Japanese machine translation,” in Proceeding of the 16th Annual Conference of the European Association for Machine Translation, pp. 35–42, 2012.
  11. J. Xu, E. Matusov, R. Zens, and H. Ney, “Integrated Chinese word segmentation in statistical machine translation,” in Proceeding of the International Workshop on Spoken Language Translation, pp. 131–137, 2005.
  12. H. Zhao, T. Yin, and J. Zhang, “Vietnamese to Chinese machine translation via Chinese character as pivot,” in Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC '13), pp. 250–259, Taipei, Taiwan, November 2013. View at Scopus
  13. H. Zhao, M. Utiyama, E. Sumita, and B.-L. Lu, “An empirical study on word segmentation for Chinese machine translation,” in Computational Linguistics and Intelligent Text Processing, vol. 7817 of Lecture Notes in Computer Science, pp. 248–263, Springer, Berlin, Germany, 2013. View at Google Scholar
  14. J. Xu, J. Gao, K. Toutanova, and H. Ney, “Bayesian semi-supervised Chinese word segmentation for statistical machine translation,” in Proceedings of the 22nd International Conference on Computational Linguistics (Coling '08), pp. 1017–1024, August 2008. View at Scopus
  15. T. L. Nguyen, S. Vogel, and N. A. Smith, “Nonparametric word segmentation for machine translation,” in Proceedings of the 23rd International Conference on Computational Linguistics (COLING '10), pp. 815–823, August 2010. View at Scopus
  16. X. Wang, M. Utiyama, A. Finch, and E. Sumita, “Empirical study of unsupervised Chinese word segmentation methods for SMT on large-scale corpora,” in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL '14), pp. 752–758, June 2014. View at Scopus
  17. X. Zeng, L. S. Chao, D. F. Wong, I. Trancoso, and L. Tian, “Toward better Chinese word segmentation for SMT via bilingual constraints,” in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 1360–1369, Association for Computational Linguistics, June 2014. View at Scopus
  18. P. F. Brown, V. J. Della Pietra, S. A. Della Pietra, and R. L. Mercer, “The mathematics of statistical machine translation: parameter estimation,” Computational Linguistics, vol. 19, no. 2, pp. 263–311, 1993. View at Google Scholar
  19. P. Tran and D. Dinh, “Retranslating number expression unknown word in Chinese-Vietnamese statistical machine translation,” Journal of Computer Science and Cybernetics, vol. 30, no. 2, pp. 127–138, 2014 (Vietnamese).
  20. P. Tran, D. Dinh, and L. Tran, “Resolving named entity unknown word in Chinese-Vietnamese machine translation,” in Proceedings of the Fifth International Conference on Knowledge and Systems Engineering (KSE '13), pp. 273–285, 2013.
  21. P. Tran, D. Dinh, T. Le, and T. Nguyen, “Handling organization name unknown word in Chinese-Vietnamese machine translation,” in Proceedings of the IEEE RIVF International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF '13), pp. 242–247, IEEE, Hanoi, Vietnam, November 2013. View at Publisher · View at Google Scholar · View at Scopus