Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2015, Article ID 136172, 19 pages
http://dx.doi.org/10.1155/2015/136172
Research Article

Exploiting Semantic Annotations and -Learning for Constructing an Efficient Hierarchy/Graph Texts Organization

Department of Computers and Systems, Faculty of Engineering, Mansoura University, Mansoura, Egypt

Received 15 July 2014; Revised 10 December 2014; Accepted 10 December 2014

Academic Editor: Miguel-Angel Sicilia

Copyright © 2015 Asmaa M. El-Said et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. I. T. Fatudimu, A. G. Musa, C. K. Ayo, and A. B. Sofoluwe, “Knowledge discovery in online repositories: a text mining approach,” European Journal of Scientific Research, vol. 22, no. 2, pp. 241–250, 2008. View at Google Scholar · View at Scopus
  2. A. Hotho, A. Nürnberger, and G. Paaß, “A brief survey of text mining,” Journal for Computational Linguistics and Language Technology, vol. 20, 2005. View at Google Scholar
  3. M. Bernotas, K. Karklius, R. Laurutis, and A. Slotkiene, “The peculiarities of the text document representation, using ontology and tagging-based clustering technique,” Journal of Information Technology and Control, vol. 36, pp. 217–220, 2007. View at Google Scholar
  4. Y. Li, Z. A. Bandar, and D. McLean, “An approach for measuring semantic similarity between words using multiple information sources,” IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 4, pp. 871–882, 2003. View at Publisher · View at Google Scholar · View at Scopus
  5. K. Shaban, O. Basir, and M. Kamel, “Document mining based on semantic understanding of text,” in Progress in Pattern Recognition, Image Analysis and Applications, vol. 4225 of Lecture Notes in Computer Science, pp. 834–843, Springer, Berlin, Germany, 2006. View at Google Scholar
  6. A. Culotta, A. McCallum, and J. Betz, “Integrating probabilistic extraction models and data mining to discover relations and patterns in text,” in Proceedings of the Human Language Technology Conference—North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL '06), pp. 296–303, June 2006. View at Scopus
  7. A. Islam and D. Inkpen, “Semantic text similarity using corpus-based word similarity and string similarity,” ACM Transactions on Knowledge Discovery from Data, vol. 2, no. 2, article 10, 2008. View at Publisher · View at Google Scholar · View at Scopus
  8. R. Feldman and J. Sanger, The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data, Cambridge University Press, 2006.
  9. R. Navigli, “Word sense disambiguation: a survey,” ACM Computing Surveys, vol. 41, no. 2, article 10, 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. W. Mao and W. W. Chu, “The phrase-based vector space model for automatic retrieval of free-text medical documents,” Data and Knowledge Engineering, vol. 61, no. 1, pp. 76–92, 2007. View at Publisher · View at Google Scholar · View at Scopus
  11. C. Siefkes and P. Siniakov, “An overview and classification of adaptive approaches to information extraction,” in Journal on Data Semantics IV, vol. 3730 of Lecture Notes in Computer Science, pp. 172–212, Springer, New York, NY, USA, 2005. View at Publisher · View at Google Scholar
  12. http://wordnet.princeton.edu/.
  13. OpenNLP, http://opennlp.apache.org/.
  14. http://www.alchemyapi.com/.
  15. W. V. Siricharoen, “Ontologies and object models in object oriented software engineering,” International Association of Engineers (IAENG) Journal of Computer Science (JICS), vol. 33, no. 1, pp. 1151–1162, 2006. View at Google Scholar
  16. S. Shehata, “A WordNet-based semantic model for enhancing text clustering,” in Proceedings of the IEEE International Conference on Data Mining Workshops (ICDMW '09), pp. 477–482, Miami, Fla, USA, December 2009. View at Publisher · View at Google Scholar · View at Scopus
  17. S. Shehata, F. Karray, and M. Kamel, “An efficient concept-based mining model for enhancing text clustering,” IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 10, pp. 1360–1371, 2010. View at Publisher · View at Google Scholar · View at Scopus
  18. E. Zemmouri, H. Behja, A. Marzak, and B. Trousse, “Ontology-based knowledge model for multi-view KDD process,” International Journal of Mobile Computing and Multimedia Communications, vol. 4, no. 3, pp. 21–33, 2012. View at Publisher · View at Google Scholar
  19. C. Marinica and F. Guillet, “Knowledge-based interactive postmining of association rules using ontologies,” IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 6, pp. 784–797, 2010. View at Google Scholar
  20. B. Choudhary and P. Bhattacharyya, “Text clustering using Universal Networking Language representation,” in Proceedings of the 11th International World Wide Web Conference, 2003.
  21. G. Pirró and J. Euzenat, “A feature and information theoretic framework for semantic similarity and relatedness,” in Proceedings of the 9th International Semantic Web Conference on the Semantic Web (ISWC '10), vol. 6496, pp. 615–630, 2010.
  22. Z. Lin, M. R. Lyu, and I. King, “MatchSim: a novel similarity measure based on maximum neighborhood matching,” Knowledge and Information Systems, vol. 32, no. 1, pp. 141–166, 2012. View at Publisher · View at Google Scholar · View at Scopus
  23. D. Lin, “An information-theoretic definition of similarity,” in Proceedings of the 15th International Conference on Machine Learning, Madison, Wis, USA, July 1998.
  24. D. Inkpen, “A statistical model for near-synonym choice,” ACM Transactions on Speech and Language Processing, vol. 4, no. 1, article 2, 2007. View at Publisher · View at Google Scholar · View at Scopus
  25. L. Han, T. Finin, P. McNamee, A. Joshi, and Y. Yesha, “Improving word similarity by augmenting PMI with estimates of word polysemy,” IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 6, pp. 1307–1322, 2013. View at Publisher · View at Google Scholar · View at Scopus
  26. J. O’Shea, Z. Bandar, K. Crockett, and D. McLean, “A comparative study of two short text semantic similarity measures,” in Agent and Multi-Agent Systems: Technologies and Applications, vol. 4953 of Lecture Notes in Artificial Intelligence, pp. 172–181, Springer, Berlin, Germany, 2008. View at Google Scholar
  27. J. Oliva, J. I. Serrano, M. D. Del Castillo, and Á. Iglesias, “SyMSS: a syntax-based measure for short-text semantic similarity,” Data and Knowledge Engineering, vol. 70, no. 4, pp. 390–405, 2011. View at Publisher · View at Google Scholar · View at Scopus
  28. I. Yoo and X. Hu, “Clustering large collection of biomedical literature based on ontology-enriched bipartite graph representation and mutual refinement strategy,” in Proceedings of the 10th PAKDD, pp. 9–12, 2006.
  29. S.-T. Yuan and Y.-C. Chen, “Semantic ideation learning for agent-based e-brainstorming,” IEEE Transactions on Knowledge and Data Engineering, vol. 20, no. 2, pp. 261–275, 2008. View at Publisher · View at Google Scholar · View at Scopus
  30. D. Liu, Z. Liu, and Q. Dong, “A dependency grammar and WordNet based sentence similarity measure,” Journal of Computational Information Systems, vol. 8, no. 3, pp. 1027–1035, 2012. View at Google Scholar · View at Scopus
  31. A. Zouaq, M. Gagnon, and O. Benoit, “Semantic analysis using dependency-based grammars and upper-level ontologies,” International Journal of Computational Linguistics and Applications, vol. 1, no. 1-2, pp. 85–101, 2010. View at Google Scholar
  32. C. Ramakrishnan, K. J. Kochut, and A. P. Sheth, “A framework for schema-driven relationship discovery from unstructured text,” in The Semantic Web—ISWC 2006, vol. 4273 of Lecture Notes in Computer Science, pp. 583–596, Springer, Berlin, Germany, 2006. View at Google Scholar
  33. C. J. C. H. Watkins and P. Dayan, “Technical note: Q-learning,” Machine Learning, vol. 8, no. 3-4, pp. 279–292, 1992. View at Publisher · View at Google Scholar
  34. M. Lan, C. L. Tan, J. Su, and Y. Lu, “Supervised and traditional term weighting methods for automatic text categorization,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp. 721–735, 2009. View at Publisher · View at Google Scholar · View at PubMed · View at Scopus
  35. L. Zhang, C. Li, J. Liu, and H. Wang, “Graph-based text similarity measurement by exploiting wikipedia as background knowledge,” World Academy of Science, Engineering and Technology, vol. 59, pp. 1548–1553, 2011. View at Google Scholar
  36. M. Kuramochi and G. Karypis, “An efficient algorithm for discovering frequent subgraphs,” IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 9, pp. 1038–1051, 2004. View at Publisher · View at Google Scholar · View at Scopus
  37. L.-C. Yu, H.-M. Shih, Y.-L. Lai, J.-F. Yeh, and C.-H. Wu, “Discriminative training for near-synonym substitution,” in Proceedings of the 23rd International Conference on Computational Linguistics (COLING '10), pp. 1254–1262, 2010.
  38. L.-C. Yu and W.-N. Chien, “Independent component analysis for near-synonym choice,” Decision Support Systems, vol. 55, no. 1, pp. 146–155, 2013. View at Publisher · View at Google Scholar
  39. L. Yao, S. Riedel, and A. Mccallum, “Collective cross-document relation extraction without labelled data,” in Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1013–1023, MIT, Cambridge, Mass, USA, October 2010.
  40. M. Shen, D.-R. Liu, and Y.-S. Huang, “Extracting semantic relations to enrich domain ontologies,” Journal of Intelligent Information Systems, vol. 39, no. 3, pp. 749–761, 2012. View at Publisher · View at Google Scholar · View at Scopus
  41. H. Chim and X. Deng, “Efficient phrase-based document similarity for clustering,” IEEE Transactions on Knowledge and Data Engineering, vol. 20, no. 9, pp. 1217–1229, 2008. View at Publisher · View at Google Scholar · View at Scopus
  42. G. A. Miller and W. G. Charles, “Contextual correlates of semantic similarity,” Language and Cognitive Processes, vol. 6, no. 1, pp. 1–28, 1991. View at Publisher · View at Google Scholar
  43. M. Steinbach, G. Karypis, and V. Kumar, “A comparison of document clustering techniques,” in Proceedings of the Knowledge Discovery and Data Mining (KDD) Workshop Text Mining, August 2000.