Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2015 (2015), Article ID 723469, 9 pages
http://dx.doi.org/10.1155/2015/723469
Research Article

Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures

Lin Guo,1,2 Wanli Zuo,1,2 Tao Peng,1,2 and Lin Yue1,2

1College of Computer Science and Technology, Jilin University, Jilin 130000, China
2Symbol Computation and Knowledge Engineer of Ministry of Education, Jilin University, Jilin 130000, China

Received 31 March 2015; Accepted 9 June 2015

Academic Editor: Chaudry Masood Khalique

Copyright © 2015 Lin Guo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. Manyika, M. Chui, and B. Brown, “Big data: the next frontier for innovation, competition, and productivity,” Tech. Rep., McKinsey Global Institute (MGI), 2011. View at Google Scholar
  2. G. Costa and R. Ortale, “On effective XML clustering by path commonality: an efficient and scalable algorithm,” in Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence (ICTAI '12), pp. 389–396, IEEE, Athens, Greece, November 2012. View at Publisher · View at Google Scholar · View at Scopus
  3. P. Antonellis, C. Makris, and N. Tsirakis, “XEdge: clustering homogeneous and heterogeneous XML documents using edge summaries,” in Proceedings of the 23rd Annual ACM Symposium on Applied Computing (SAC '08), pp. 1081–1088, March 2008. View at Publisher · View at Google Scholar · View at Scopus
  4. M. J. Zaki and C. C. Aggarwal, “XRules: an effective algorithm for structural classification of XML data,” Machine Learning, vol. 62, no. 1-2, pp. 137–170, 2006. View at Publisher · View at Google Scholar · View at Scopus
  5. S. Tan, “An effective refinement strategy for KNN text classifier,” Expert Systems with Applications, vol. 30, no. 2, pp. 290–298, 2006. View at Publisher · View at Google Scholar · View at Scopus
  6. G. Costa, G. Manco, R. Ortale, and E. Ritacco, “Hierarchical clustering of XML documents focused on structural components,” Data and Knowledge Engineering, vol. 84, pp. 26–46, 2013. View at Publisher · View at Google Scholar · View at Scopus
  7. J.-B. Gao, B.-W. Zhang, and X.-H. Chen, “A WordNet-based semantic similarity measurement combining edge-counting and information content theory,” Engineering Applications of Artificial Intelligence, vol. 39, pp. 80–88, 2015. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Joshi, N. Agrawal, R. Krishnapuram, and S. Negi, “A bag of paths model for measuring structural similarity in Web documents,” in Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '03), pp. 577–582, August 2003. View at Publisher · View at Google Scholar · View at Scopus
  9. K. Robles, A. Fraga, J. Morato, and J. Llorens, “Towards an ontology-based retrieval of UML class diagrams,” Information and Software Technology, vol. 54, no. 1, pp. 72–86, 2012. View at Publisher · View at Google Scholar · View at Scopus
  10. H.-C. Chu, M.-Y. Chen, and Y.-M. Chen, “A semantic-based approach to content abstraction and annotation for content management,” Expert Systems with Applications, vol. 36, no. 2, pp. 2360–2376, 2009. View at Publisher · View at Google Scholar · View at Scopus
  11. D. Sánchez and D. Isern, “Automatic extraction of acronym definitions from the Web,” Applied Intelligence, vol. 34, no. 2, pp. 311–327, 2011. View at Publisher · View at Google Scholar · View at Scopus
  12. J. Marés and V. Torra, “On the protection of social networks user's information,” Knowledge-Based Systems, vol. 49, pp. 134–144, 2013. View at Publisher · View at Google Scholar · View at Scopus
  13. M. Batet, A. Erola, D. Sánchez, and J. Castellà-Roca, “Utility preserving query log anonymization via semantic microaggregation,” Information Sciences, vol. 242, pp. 49–63, 2013. View at Publisher · View at Google Scholar · View at Scopus
  14. M. Liu, W. M. Shen, Q. Hao, and J. W. Yan, “An weighted ontology-based semantic similarity algorithm for web service,” Expert Systems with Applications, vol. 36, no. 10, pp. 12480–12490, 2009. View at Publisher · View at Google Scholar · View at Scopus
  15. P. Resnik, “Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language,” Journal of Artificial Intelligence Research, vol. 11, pp. 95–130, 1999. View at Google Scholar · View at Scopus
  16. A. Tagarelli, “Exploring dictionary-based semantic relatedness in labeled tree data,” Information Sciences, vol. 220, pp. 244–268, 2013. View at Publisher · View at Google Scholar · View at Scopus
  17. W. De Smet and M.-F. Moens, “Representations for multi-document event clustering,” Data Mining and Knowledge Discovery, vol. 26, no. 3, pp. 533–558, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  18. Y. Guo, Z. Shao, and N. Hua, “Automatic text categorization based on content analysis with cognitive situation models,” Information Sciences, vol. 180, no. 5, pp. 613–630, 2010. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  19. D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet allocation,” Journal of Machine Learning Research, vol. 3, no. 4-5, pp. 993–1022, 2003. View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  20. Y. Xin, J. Yang, Z.-Q. Xie, and J.-P. Zhang, “An overlapping semantic community detection algorithm base on the ARTs multiple sampling models,” Expert Systems with Applications, vol. 42, no. 7, pp. 3420–3432, 2015. View at Publisher · View at Google Scholar · View at Scopus
  21. T. Zesch and I. Gurevych, “Wisdom of crowds versus wisdom of linguists—measuring the semantic relatedness of words,” Natural Language Engineering, vol. 16, no. 1, pp. 25–59, 2010. View at Publisher · View at Google Scholar · View at Scopus
  22. G. Salton, Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Addison-Wesley, 1989.
  23. L. Tang, S. Rajan, and V. K. Narayanan, “Large scale multi-label classification via MetaLabeler,” in Proceedings of the 18th International World Wide Web Conference (WWW '09), pp. 211–220, New York, NY, USA, April 2009. View at Publisher · View at Google Scholar · View at Scopus