Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2013, Article ID 641927, 10 pages
http://dx.doi.org/10.1155/2013/641927
Research Article

Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets

1School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
2Gansu Computing Center, Lanzhou 730000, China

Received 27 August 2013; Revised 11 October 2013; Accepted 13 October 2013

Academic Editor: Gelan Yang

Copyright © 2013 Mingwei Leng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. S. Basu, A. Banerjee, and R. J. Mooney, “Semi-supervised clustering by seeding,” in Proceedings of the 19th International Conference on Machine Learning (ICML '02), pp. 27–34, 2002.
  2. M. Bilenko, S. Basu, and R. J. Mooney, “A probabilistic framework for semi-supervised clustering,” in Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '04), pp. 59–68, August 2004. View at Scopus
  3. M. Leng, X. Chen, and L. Li, “K-means clustering algorithm based on semi-supervised learning,” Journal of Computational Information Systems, vol. 4, no. 5, pp. 2007–2013, 2008. View at Google Scholar · View at Scopus
  4. Y. Dang, Z. Xuan, L. Rong, and M. Liu, “A novel initialization method for semi-supervised clustering,” in Proceedings of the 4th International Conference on Knowledge Science, Engineering and Management, pp. 317–328, 2010.
  5. S. Zhong, “Semi-supervised model-based document clustering: a comparative study,” Machine Learning, vol. 65, no. 1, pp. 3–29, 2006. View at Publisher · View at Google Scholar · View at Scopus
  6. L. Lelis and J. Sander, “Semi-supervised density-based clustering,” in Proceedings of the 9th IEEE International Conference on Data Mining (ICDM '09), pp. 842–847, December 2009. View at Publisher · View at Google Scholar · View at Scopus
  7. C. Böhm and C. Plant, “HISSCLU: a hierarchical density-based method for semi-supervised clustering,” in Proceedings of the 11th International Conference on Extending Database Technology (EDBT '08), pp. 440–451, March 2008. View at Publisher · View at Google Scholar · View at Scopus
  8. R. Guan, X. Shi, M. Marchese, C. Yang, and Y. Liang, “Text clustering with seeds affinity propagation,” IEEE Transactions on Knowledge and Data Engineering, vol. 23, no. 4, pp. 627–637, 2011. View at Publisher · View at Google Scholar · View at Scopus
  9. M. Shiga and H. Mamitsuka, “Efficient semi-supervised learning on locally informative multiple graphs,” Pattern Recognition, vol. 45, no. 3, pp. 1035–1049, 2012. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  10. K. Wagstaff, C. Cardie, S. Rogers, and S. Schroedel, “Constrained k-means clustering with background knowledge,” in Proceedings of the 18th International Conference on Machine Learning (ICML '01), pp. 77–584, 2001.
  11. D. Klein, S. Kamvar, and C. Manning, “From instance-level constraints to space- level constraints: making the most of prior knowledge in data clustering,” in Proceedings of the 19th International Conference on Machine Learning (ICML '02), pp. 307–314, 2002.
  12. C. Ruiz, M. Spiliopoulou, and E. Menasalvas, “Density-based semi-supervised clustering,” Data Mining and Knowledge Discovery, vol. 21, no. 3, pp. 345–370, 2010. View at Publisher · View at Google Scholar · View at MathSciNet
  13. X. Wang and I. Davidson, “Flexible constrained spectral clustering,” in Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '10), pp. 563–572, July 2010. View at Publisher · View at Google Scholar · View at Scopus
  14. I. S. Dhillon, Y. Guan, and B. Kulis, “Weighted graph cuts without eigenvectors a multilevel approach,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 11, pp. 1944–1957, 2007. View at Publisher · View at Google Scholar · View at Scopus
  15. Z. Lu and M. Á. Carreira-Perpiñán, “Constrained spectral clustering through affinity propagation,” in Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '08), pp. 1–8, June 2008. View at Publisher · View at Google Scholar · View at Scopus
  16. C. Alzate and J. A. K. Suykens, “A regularized formulation for spectral clustering with pairwise constraints,” in Proceedings of the International Joint Conference on Neural Networks (IJCNN '09), pp. 141–148, June 2009. View at Publisher · View at Google Scholar · View at Scopus
  17. W. Pedrycz and J. Waletzky, “Fuzzy clustering with partial supervision,” IEEE Transactions on Systems, Man, and Cybernetics B, vol. 27, no. 5, pp. 787–795, 1997. View at Publisher · View at Google Scholar · View at Scopus
  18. A. Bouchachia and W. Pedrycz, “A semi-supervised clustering algorithm for data exploration,” in Proceedings of the 10th International Fuzzy Systems Association World Congress Conference on Fuzzy Sets and Systems, pp. 328–337, 2003.
  19. J. Gao, P.-N. Tan, and H. Cheng, “Semi-supervised clustering with partial background information,” in Proceedings of the 6th SIAM International Conference on Data Mining (SDM '06), pp. 489–493, 2006. View at MathSciNet
  20. N. Grira, M. Crucianu, and N. Boujemaa, “Semi-supervised fuzzy clustering with pairwise-constrained competitive agglomeration,” in Proceedings of the 14th IEEE International Conference on Fuzzy Systems, pp. 867–872, May 2005. View at Scopus
  21. W. Pedrycz, V. Loia, and S. Senatore, “P-FCM: a proximity-based fuzzy clustering,” Fuzzy Sets and Systems, vol. 148, no. 1, pp. 21–41, 2004. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  22. Y. Yan, L. Chen, and W.-C. Tjhi, “Fuzzy semi-supervised co-clustering for text documents,” Fuzzy Sets and Systems, vol. 215, pp. 74–89, 2013. View at Publisher · View at Google Scholar · View at MathSciNet
  23. D. Lewis and W. Gale, “A sequential algorithm for training text classifiers,” in Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12, 1994.
  24. A. Culotta and A. McCallum, “Reducing labeling effort for structured prediction tasks,” in Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference (AAAI '05), pp. 746–751, July 2005. View at Scopus
  25. B. Settles and M. Craven, “An analysis of active learning strategies for sequence labeling tasks,” in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '08), pp. 1069–1078, 2008.
  26. T. Scheffer, C. Decomain, and S. Wrobel, “Active hidden markov models for information extraction,” in Proceedings of the International Conference on Advances in Intelligent Data Analysis (CAIDA '01), pp. 309–318, 2001.
  27. R. Hwa, “Sample selection for statistical parsing,” Computational Linguistics, vol. 30, no. 3, pp. 253–276, 2004. View at Publisher · View at Google Scholar · View at MathSciNet
  28. T. Scheffer and S. Wrobel, “Active learning of partially hidden markov models,” in Proceedings of ECML/PKDD Workshop on Instance Selection, 2001. View at Publisher · View at Google Scholar
  29. P. K. Mallapragada, R. Jin, and A. K. Jain, “Active query selection for semi-supervised clustering,” in Proceedings of the 19th International Conference on Pattern Recognition (ICPR '08), pp. 1–4, December 2008.
  30. H. T. Nguyen and A. Smeulders, “Active learning using pre-clustering,” in Proceedingsof the 21st International Conference on Machine Learning (ICML '04), pp. 623–630, July 2004. View at Scopus
  31. V.-V. Vu, N. Labroche, and B. Bouchon-Meunier, “Active learning for semi-supervised K-means clustering,” in Proceedings of the 22nd International Conference on Tools with Artificial Intelligence (ICTAI '10), pp. 12–15, October 2010. View at Publisher · View at Google Scholar · View at Scopus
  32. W. Zhao, Q. He, H. Ma, and Z. Shi, “Effective semi-supervised document clustering via active learning with instance-level constraints,” Knowledge and Information Systems, vol. 30, no. 3, pp. 569–587, 2012. View at Publisher · View at Google Scholar · View at Scopus
  33. N. Grira, M. Crucianu, and N. Boujemaa, “Active semi-supervised fuzzy clustering,” Pattern Recognition, vol. 41, no. 5, pp. 1834–1844, 2008. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  34. X. Wang and I. Davidson, “Active spectral clustering,” in Proceedings of the 10th IEEE International Conference on Data Mining (ICDM '10), pp. 561–568, December 2010. View at Publisher · View at Google Scholar · View at Scopus
  35. R. Huang, W. Lam, and Z. Zhang, “Active learning of constraints for semi-supervised text clustering,” in Proceedings of the 7th SIAM International Conference on Data Mining (SDM '07), pp. 113–124, April 2007. View at Scopus
  36. C. T. Zahn, “Graph-theoretical methods for detecting and describing gestalt clusters,” IEEE Transactions on Computers, vol. 20, no. 1, pp. 68–86, 1971. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  37. Uci datasets, http://archive.ics.uci.edu/ml/datasets.html.