Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2016, Article ID 6394253, 14 pages
http://dx.doi.org/10.1155/2016/6394253
Research Article

An Improved Semisupervised Outlier Detection Algorithm Based on Adaptive Feature Weighted Clustering

1College of Science, Harbin Engineering University, Harbin 150001, China
2College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China

Received 13 September 2016; Revised 2 November 2016; Accepted 20 November 2016

Academic Editor: Filippo Ubertini

Copyright © 2016 Tingquan Deng and Jinhong Yang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, Elsevier, 2011.
  2. D. M. Hawkins, Identification of Outliers, Chapman & Hall, London, UK, 1980. View at MathSciNet
  3. V. Bamnett and T. Lewis, Outliers in Statistical Data, John Wiley & Sons, Chichester, UK, 1994.
  4. V. J. Hodge and J. Austin, “A survey of outlier detection methodologies,” Artificial Intelligence Review, vol. 22, no. 2, pp. 85–126, 2004. View at Publisher · View at Google Scholar
  5. B. Sheng, Q. Li, W. Mao, and W. Jin, “Outlier detection in sensor networks,” in Proceedings of the 8th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc '07), pp. 219–228, Montréal, Canada, September 2007.
  6. A. P. James and S. Dimitrijev, “Inter-image outliers and their application to image classification,” Pattern Recognition, vol. 43, no. 12, pp. 4101–4112, 2010. View at Publisher · View at Google Scholar
  7. J. Huang, Q. Zhu, L. Yang, and J. Feng, “A non-parameter outlier detection algorithm based on Natural Neighbor,” Knowledge-Based Systems, vol. 92, pp. 71–77, 2016. View at Publisher · View at Google Scholar · View at Scopus
  8. J. M. Shepherd and S. J. Burian, “Detection of urban-induced rainfall anomalies in a major coastal city,” Earth Interactions, vol. 7, no. 4, pp. 1–17, 2003. View at Publisher · View at Google Scholar
  9. O. Alan and C. Catal, “Thresholds based outlier detection approach for mining class outliers: an empirical case study on software measurement datasets,” Expert Systems with Applications, vol. 38, no. 4, pp. 3440–3445, 2011. View at Publisher · View at Google Scholar · View at Scopus
  10. B. Schölkopf, J. C. Platt, J. Shawe-Taylor, A. J. Smola, and R. C. Williamson, “Estimating the support of a high-dimensional distribution,” Neural Computation, vol. 13, no. 7, pp. 1443–1471, 2001. View at Publisher · View at Google Scholar · View at Scopus
  11. D. Tax, A. Ypma, and R. Duin, “Support vector data description applied to machine vibration analysis,” in Proceedings of the 5th Annual Conference of the Advanced School for Computing and Imaging, pp. 398–405, Heijen, The Netherlands, June 1999.
  12. D. M. J. Tax and R. P. W. Duin, “Support vector data description,” Machine Learning, vol. 54, no. 1, pp. 45–66, 2004. View at Publisher · View at Google Scholar · View at Scopus
  13. S. Ramaswamy, R. Rastogi, and K. Shim, “Efficient algorithms for mining outliers from large data sets,” SIGMOD Record (ACM Special Interest Group on Management of Data), vol. 29, no. 2, pp. 427–438, 2000. View at Google Scholar · View at Scopus
  14. E. M. Knox and R. T. Ng, “Algorithms for mining distance based outliers in large dataset,” in Proceedings of the International Conference on Very Large Data Bases, pp. 392–403, Citeseer, New York, NY, USA, 1998.
  15. M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander, “LOF: identifying density-based local outliers,” ACM SIGMOD Record, vol. 29, no. 2, pp. 93–104, 2000. View at Publisher · View at Google Scholar
  16. J. Ha, S. Seok, and J.-S. Lee, “A precise ranking method for outlier detection,” Information Sciences, vol. 324, pp. 88–107, 2015. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  17. R. N. Dave, “Characterization and detection of noise in clustering,” Pattern Recognition Letters, vol. 12, no. 11, pp. 657–664, 1991. View at Publisher · View at Google Scholar · View at Scopus
  18. R. Smith, A. Bivens, M. Embrechts, C. Palagiri, and B. Szymanski, “Clustering approaches for anomaly based intrusion detection,” Proceedings of Intelligent Engineering Systems Through Artificial Neural Networks, pp. 579–584, 2002. View at Google Scholar
  19. Y. Shi and L. Zhang, “COID: a cluster-outlier iterative detection approach to multi-dimensional data analysis,” Knowledge and Information Systems, vol. 28, no. 3, pp. 709–733, 2011. View at Publisher · View at Google Scholar · View at Scopus
  20. J. Zhao, K. Liu, W. Wang, and Y. Liu, “Adaptive fuzzy clustering based anomaly data detection in energy system of steel industry,” Information Sciences, vol. 259, pp. 335–345, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. F. Angiulli and C. Pizzuti, “Fast outlier detection in high dimensional spaces,” in Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD '02), vol. 2, pp. 15–26, August 2002.
  22. M. Radovanović, A. Nanopoulos, and M. Ivanović, “Reverse nearest neighbors in unsupervised distance-based outlier detection,” IEEE Transactions on Knowledge and Data Engineering, vol. 27, no. 5, pp. 1369–1382, 2015. View at Publisher · View at Google Scholar · View at Scopus
  23. A. Daneshpazhouh and A. Sami, “Entropy-based outlier detection using semi-supervised approach with few positive examples,” Pattern Recognition Letters, vol. 49, pp. 77–84, 2014. View at Publisher · View at Google Scholar · View at Scopus
  24. J. Gao, H. Cheng, and P.-N. Tan, “Semi-supervised outlier detection,” in Proceedings of the ACM Symposium on Applied Computing, pp. 635–636, ACM, Dijon, France, April 2006.
  25. Z. Xue, Y. Shang, and A. Feng, “Semi-supervised outlier detection based on fuzzy rough C-means clustering,” Mathematics and Computers in Simulation, vol. 80, no. 9, pp. 1911–1921, 2010. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  26. J. Z. Huang, M. K. Ng, H. Rong, and Z. Li, “Automated variable weighting in k-means type clustering,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 657–668, 2005. View at Publisher · View at Google Scholar · View at Scopus
  27. H. Friguiand and O. Nasraou, “Unsupervised learning of prototypes and attribute weights,” Pattern Recognition, vol. 37, no. 3, pp. 567–581, 2004. View at Publisher · View at Google Scholar
  28. J. Zhou, L. Chen, C. P. Chen, Y. Zhang, and H. Li, “Fuzzy clustering with the entropy of attribute weights,” Neurocomputing, vol. 198, pp. 125–134, 2016. View at Publisher · View at Google Scholar
  29. M. Hassan, A. Chaudhry, A. Khan, and M. A. Iftikhar, “Robust information gain based fuzzy c-means clustering and classification of carotid artery ultrasound images,” Computer Methods and Programs in Biomedicine, vol. 113, no. 2, pp. 593–609, 2014. View at Publisher · View at Google Scholar · View at Scopus
  30. R. Krishnapuram and J. M. Keller, “A possibilistic approach to clustering,” IEEE Transactions on Fuzzy Systems, vol. 1, no. 2, pp. 98–110, 1993. View at Publisher · View at Google Scholar · View at Scopus
  31. M.-S. Yang and K.-L. Wu, “Unsupervised possibilistic clustering,” Pattern Recognition, vol. 39, no. 1, pp. 5–21, 2006. View at Publisher · View at Google Scholar · View at Scopus
  32. S. M. Guo, L. C. Chen, and J. S. H. Tsai, “A boundary method for outlier detection based on support vector domain description,” Pattern Recognition, vol. 42, no. 1, pp. 77–83, 2009. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  33. T. Fawcett, “Roc graphs: notes and practical considerations for researchers,” Machine Learning, vol. 31, no. 1, pp. 1–38, 2004. View at Google Scholar
  34. C. Blake and C. J. Merz, “UCI repository of machine learning databases,” 1998.
  35. C. C. Aggarwal and P. S. Yu, “Outlier detection for high dimensional data,” in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '01), Santa Barbara, Calif, USA, May 2001. View at Publisher · View at Google Scholar