Table of Contents Author Guidelines Submit a Manuscript
Computational Intelligence and Neuroscience
Volume 2015 (2015), Article ID 434153, 13 pages
http://dx.doi.org/10.1155/2015/434153
Research Article

Evaluating a Pivot-Based Approach for Bilingual Lexicon Extraction

Department of Computer Engineering, Korea Maritime and Ocean University, 727 Taejong-ro, Yeongdo-gu, Busan 606-791, Republic of Korea

Received 29 December 2014; Revised 26 March 2015; Accepted 9 April 2015

Academic Editor: Francesco Camastra

Copyright © 2015 Jae-Hoon Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. D. Andrade, T. Matsuzaki, and J. Tsujii, “Statistical extraction and comparison of pivot words for bilingual lexicon extension,” ACM Transactions on Asian Language Information Processing, vol. 11, no. 2, article 6, 31 pages, 2012. View at Publisher · View at Google Scholar · View at Scopus
  2. P. Fung, “Compiling bilingual lexicon entries from a non-parallel English-Chinese corpus,” in Proceedings of the 3rd Workshop on Very Large Corpora (VLC '95), pp. 173–183, June 1995.
  3. R. Rapp, “Identifying word translations in nonparallel texts,” in Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics (ACL '95), pp. 320–322, 1995.
  4. M. Sahlgren and J. Karlgren, “Automatic bilingual lexicon acquisition using random indexing of parallel corpora,” Natural Language Engineering, vol. 11, no. 3, pp. 327–341, 2005. View at Publisher · View at Google Scholar · View at Scopus
  5. J. Klavans and E. Tzoukermann, “Combining corpus and machine-readable dictionary data for building bilingual lexicons,” Machine Translation, vol. 10, pp. 1–34, 1996. View at Google Scholar
  6. P. Fung, “A statistical view on bilingual lexicon extraction: from parallel corpora to non-parallel corpora,” in Proceedings of the Parallel Text Processing, pp. 1–17, 1998.
  7. P. F. Brown, J. Cocke, S. D. Pietra et al., “A statistical approach to machine translation,” Computational Linguistics, vol. 16, no. 2, pp. 79–85, 1990. View at Google Scholar
  8. F. J. Och and H. Ney, “Improved statistical alignment models,” in Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL '00), pp. 440–447, October 2000. View at Publisher · View at Google Scholar
  9. A. B. Ismail, Minimally supervised techniques for bilingual lexicon extraction [Ph.D. dissertation], Department of Computer Science, The University of York, 2012.
  10. R. Rapp, “Automatic identification of word translations from unrelated English and German corpora,” in Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL '99), pp. 519–526, College Park, Md, USA, June 1999. View at Publisher · View at Google Scholar
  11. P. Koehn and K. Knight, “Learning a translation lexicon from monolingual corpora,” in Proceedings of the Association for Computational Linguistics on Unsupervised Lexical Acquisition (ULA '02), pp. 9–16, July 2002. View at Publisher · View at Google Scholar
  12. H. Kaji, S. Tanamura, and D. Erdenebat, “Automatic construction of a Japanese-Chinese dictionary via English,” in Proceedings of the International Conference on Language Resources and Evaluation (LREC '08), pp. 699–706, Marrakech, Morocco, May 2008.
  13. J. H. Kim, H. W. Seo, and H. S. Kwon, “Bilingual lexicon induction through a pivot language,” Journal of the Korean Society of Marine Engineering, vol. 37, no. 3, pp. 300–306, 2013. View at Publisher · View at Google Scholar
  14. H. S. Kwon, H. W. Seo, and J. H. Kim, “Bilingual lexicon extraction via pivot language and word alignment tool,” in Proceedings of the 6th Workshop on Building and Using Comparable Corpora, pp. 11–15, 2013.
  15. K. Tanaka and K. Umemura, “Construction of a bilingual dictionary intermediated by a third language,” in Proceedings of the International Conference on Computational Linguistics (COLING '94), pp. 297–303, Kyoto, Japan, August 1994. View at Publisher · View at Google Scholar
  16. T. Tsunakawa, N. Okazaki, and J. Jsujii, “Building a bilingual lexicon using phrase-based statistical machine translation via a pivot language,” in Proceedings of the International Conference on Computational Linguistics (COLING '08), pp. 127–130, 2008.
  17. G. S. Mann and D. Yarowsky, “Multipath translation lexicon induction via bridge languages,” in Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies (NAACL '01), pp. 1–8, Pittsburgh, Pa, USA, June 2001. View at Publisher · View at Google Scholar
  18. D. Bouamor, N. Semmar, and P. Zweigenbaum, “Towards a generic approach for bilingual lexicon extraction from comparable corpora,” in Proceedings of the 14th Machine Translation Summit, pp. 143–150, 2013.
  19. P. Fung and K. McKeown, “Finding terminology translation from non-parallel corpora,” in Proceedings of the 5th Annual Workshop on Very Large Corpora (VLC '97), pp. 192–202, Hong Kong, 1997.
  20. I. Vulić and M. F. Moens, “A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else),” in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '13), pp. 1613–1624, Seattle, Wash, USA, October 2013.
  21. Z. S. Harris, “Distributional structure,” Word, vol. 10, no. 23, pp. 146–162, 1954. View at Google Scholar
  22. M. Apidianaki, N. Ljubešić, and D. Fišer, “Disambiguating vectors for bilingual lexicon extraction from comparable corpora,” in Proceedings of the 8th Language Technologies Conference, pp. 10–15, Ljubljana, Slovenia, October 2012.
  23. A. Lardilleux, Y. Lepage, and F. Yvon, “The contribution of low frequencies to multilingual sub-sentential alignment: a differential associative approach,” International Journal of Advanced Intelligence, vol. 3, no. 2, pp. 189–217, 2011. View at Google Scholar
  24. P. Fung and K. W. Church, “K-vec: a new approach for aligning parallel texts,” in Proceedings of the 15th Conference on Computational Linguistics, pp. 1096–1102, Kyoto, Japan, August 1994. View at Publisher · View at Google Scholar
  25. S. Evert, The statistics of word co-occurrences: word pairs and collocations [Ph.D. thesis], Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, Stuttgart, Germany, 2013.
  26. P. Pecina, “Lexical association measures and collocation extraction,” Language Resources and Evaluation, vol. 44, no. 1-2, pp. 137–158, 2010. View at Publisher · View at Google Scholar · View at Scopus
  27. C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing, MIT Press, Cambridge, Mass, USA, 1999. View at MathSciNet
  28. H. Schmid, “Probabilistic part-of-speech tagging using decision trees,” in Proceedings of the International Conference on New Methods in Language Processing, pp. 44–49, Manchester, UK, 1994.
  29. W. Gale and K. W. Church, “Identifying word correspondences in parallel texts,” in Proceedings of the 4th DARPA Speech and Natural Language Workshop, pp. 152–157, 1991.
  30. H. W. Seo, H. Kim, H. Y. Cho, J. H. Kim, and S. W. Yang, “Automatically constructing English-Korean parallel corpus from web documents,” in Proceedings of the 26th on Korea Information Processing Society Fall Conference, vol. 13, pp. 161–164, 2006.
  31. P. Koehn, “Europarl: a parallel corpus for statistical machine translation,” in Proceedings of the 10th Conference on the Machine Translation Summit, pp. 79–86, 2005.
  32. W. Lee, S. Kim, G. Kim, and K. Choi, “Implementation of modularized morphological analyzer,” in Proceedings of the 11th Annual Conference on Human and Cognitive Language Technology (HCLT '99), pp. 123–136, 1999.
  33. D. Chatterjee, S. Sarkar, and A. Mishra, “Co-occurrence graph based iterative bilingual lexicon extraction from comparable corpora,” in Proceedings of the 4th International Workshop on Cross Lingual Information Access (COLING '10), pp. 35–42, 2010.
  34. M. Voorhees, “TREC-8 question answering track report,” in Proceedings of the 8th Text Retrieval Conference, pp. 77–82, Gaithersburg, Md, USA, November 1999.
  35. A. Haghighi, P. Liang, T. Berg-Kirkpatrick, and D. Klein, “Learning bilingual lexicons from monolingual corpora,” in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL '08), pp. 771–779, June 2008. View at Scopus
  36. H. W. Seo, H. S. Kwon, and J. H. Kim, “Rated recall: evaluation method for constructing bilingual lexicons,” in Proceedings of the 25th Annual Conference on Human and Cognitive Language Technology (HCLT '13), pp. 146–151, October 2013.
  37. E. Gaussier, J. M. Renders, I. Matveeva, C. Goutter, and H. Déjern, “A geometric view on bilingual lexicon extraction from comparable corpora,” in Proceedings of the 42th Annual Meeting of the Association for Computational Linguistics (ACL '04), pp. 526–533, Barcelona, Spain, 2004.