Table of Contents Author Guidelines Submit a Manuscript
Journal of Sensors
Volume 2017, Article ID 8513949, 25 pages
https://doi.org/10.1155/2017/8513949
Research Article

Optimum Pipeline for Visual Terrain Classification Using Improved Bag of Visual Words and Fusion Methods

1Institute of Medical Equipment, Academy of Military Medical Science, Tianjin 300161, China
2The State Key Laboratory of Intelligent Technology and System, Computer Science and Technology School, Tsinghua University, Beijing 100084, China

Correspondence should be addressed to Jinggong Sun; moc.anis.piv@gjnus

Received 5 June 2016; Revised 17 October 2016; Accepted 19 January 2017; Published 29 March 2017

Academic Editor: Raymond Swartz

Copyright © 2017 Hang Wu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. P. Papadakis, “Terrain traversability analysis methods for unmanned ground vehicles: a survey,” Engineering Applications of Artificial Intelligence, vol. 26, no. 4, pp. 1373–1385, 2013. View at Publisher · View at Google Scholar · View at Scopus
  2. B. H. Wilcox, “Non-geometric hazard detection for a mars microrover,” in Proceedings of the NASA Conference Publication, pp. 675–675, NASA, 1994.
  3. M. Bajracharya, A. Howard, L. H. Matthies, B. Tang, and M. Turmon, “Autonomous off-road navigation with end-to-end learning for the LAGR program,” Journal of Field Robotics, vol. 26, no. 1, pp. 3–25, 2009. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  4. F. L. Garcia Bermudez, R. C. Julian, D. W. Haldane, P. Abbeel, and R. S. Fearing, “Performance analysis and terrain classification for a legged robot over rough terrain,” in Proceedings of the 25th IEEE/RSJ International Conference on Robotics and Intelligent Systems (IROS '12), pp. 513–519, October 2012. View at Publisher · View at Google Scholar · View at Scopus
  5. A. Angelova, L. Matthies, D. Helmick, and P. Perona, “Fast terrain classification using variable-length representation for autonomous navigation,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '07), 8, p. 1, June 2007. View at Publisher · View at Google Scholar · View at Scopus
  6. Y. N. Khan, Visual Terrain Classification for Outdoor Mobile Robots, Universität Tübingen, 2013.
  7. A. Howard, H. Seraji, and E. Tunstel, “A rule-based Fuzzy Traversability Index for mobile robot navigation,” in Proceedings of the IEEE International Conference on Robotics and Automation, vol. 4, pp. 3067–3071, Seoul, South Korea, May 2001. View at Publisher · View at Google Scholar · View at Scopus
  8. A. Howard and H. Seraji, “Vision-based terrain characterization and traversability assessment,” Journal of Robotic Systems, vol. 18, no. 10, pp. 577–587, 2001. View at Publisher · View at Google Scholar · View at Scopus
  9. Y. Guo, A. Song, J. Bao, and H. Zhang, “Optimal path planning in field based on traversability prediction for mobile robot,” in Proceedings of the International Conference on Electric Information and Control Engineering (ICEICE '11), pp. 563–566, IEEE, Wuhan, China, April 2011. View at Publisher · View at Google Scholar · View at Scopus
  10. D. Kim, S. M. Oh, and J. M. Rehg, “Traversability classification for UGV navigation: a comparison of patch and superpixel representations,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '07), pp. 3166–3173, November 2007. View at Publisher · View at Google Scholar · View at Scopus
  11. Y. N. Khan, P. Komma, and A. Zell, “High resolution visual terrain classification for outdoor robots,” in Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV Workshops '11), pp. 1014–1021, November 2011. View at Publisher · View at Google Scholar · View at Scopus
  12. J.-S. Lee, M. R. Grunes, E. Pottier, and L. Ferro-Famil, “Unsupervised terrain classification preserving polarimetric scattering characteristics,” IEEE Transactions on Geoscience and Remote Sensing, vol. 42, no. 4, pp. 722–731, 2004. View at Publisher · View at Google Scholar · View at Scopus
  13. K. Walas, “Terrain classification and negotiation with a walking robot,” Journal of Intelligent & Robotic Systems, vol. 78, no. 3-4, pp. 401–423, 2015. View at Publisher · View at Google Scholar · View at Scopus
  14. N. Anantrasirichai, J. Burn, and D. Bull, “Terrain classification from body-mounted cameras during human locomotion,” IEEE transactions on cybernetics, vol. 45, no. 10, pp. 2249–2260, 2015. View at Publisher · View at Google Scholar · View at Scopus
  15. I. Gheorghe, W. Li, T. Popham, A. Gaszczak, and K. J. Burnham, “Key learning features as means for terrain classification,” in Advances in Systems Science, pp. 273–282, Springer, 2014. View at Publisher · View at Google Scholar
  16. S. Zenker, E. E. Aksoy, D. Goldschmidt, F. Worgotter, and P. Manoonpong, “Visual terrain classification for selecting energy efficient gaits of a hexapod robot,” in Proceedings of the IEEE/ASME International Conference on Advanced Intelligent Mechatronics: Mechatronics for Human Wellbeing (AIM '13), pp. 577–584, IEEE, Wollongong, Australia, July 2013. View at Publisher · View at Google Scholar · View at Scopus
  17. P. Filitchkin and K. Byl, “Feature-based terrain classification for LittleDog,” in Proceedings of the 25th IEEE/RSJ International Conference on Robotics and Intelligent Systems (IROS '12), pp. 1387–1392, October 2012. View at Publisher · View at Google Scholar · View at Scopus
  18. Y. Zou, W. Chen, L. Xie, and X. Wu, “Comparison of different approaches to visual terrain classification for outdoor mobile robots,” Pattern Recognition Letters, vol. 38, no. 1, pp. 54–62, 2014. View at Publisher · View at Google Scholar · View at Scopus
  19. Y. Yang and S. Newsam, “Bag-of-visual-words and spatial extensions for land-use classification,” in Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS '10), pp. 270–279, ACM, San Jose, Calif, USA, November 2010. View at Publisher · View at Google Scholar · View at Scopus
  20. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Proceedings of the 26th Annual Conference on Neural Information Processing Systems (NIPS '12), pp. 1097–1105, December 2012. View at Scopus
  21. G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” Science, vol. 313, no. 5786, pp. 504–507, 2006. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  22. Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, “Greedy layer-wise training of deep networks,” in Proceedings of the 20th Annual Conference on Neural Information Processing Systems (NIPS '06), pp. 153–160, December 2006. View at Scopus
  23. H. Wu, B. Liu, W. Su, W. Zhang, and J. Sun, “Deep filter banks for land-use scene classification,” IEEE Geoscience and Remote Sensing Letters, vol. 13, pp. 1895–1899, 2016. View at Google Scholar
  24. P. Filitchkin, Visual terrain classification for legged robots [M.S. thesis], University of California, Santa Barbara, Calif, USA, 2011.
  25. H. Wu, B. Liu, W. Su, W. Zhang, and J. Sun, “Hierarchical coding vectors for scene level land-use classification,” Remote Sensing, vol. 8, no. 5, article 436, 2016. View at Publisher · View at Google Scholar · View at Scopus
  26. K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, “The devil is in the details: an evaluation of recent feature encoding methods,” in Proceedings of the 22nd British Machine Vision Conference (BMVC '11), September 2011. View at Publisher · View at Google Scholar · View at Scopus
  27. P. Gehler and S. Nowozin, “On feature combination for multiclass object classification,” in Proceedings of the IEEE 12th International Conference on Computer Vision, pp. 221–228, IEEE, September-October 2009.
  28. A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman, “Multiple Kernels for object detection,” in Proceedings of the 12th International Conference on Computer Vision (ICCV '09), pp. 606–613, IEEE, Kyoto, Japan, October 2009. View at Publisher · View at Google Scholar · View at Scopus
  29. H. Wu, B. Z. Liu, W. H. Su, W. C. Zhang, and J. G. Sun, “Bag of words for visual terrain classification: a comprehensive study,” Journal of Image and Graphics, vol. 21, no. 10, pp. 1276–1288, 2016. View at Google Scholar
  30. Y. Huang, Z. Wu, L. Wang, and T. Tan, “Feature coding in image classification: a comprehensive study,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 3, pp. 493–506, 2014. View at Publisher · View at Google Scholar · View at Scopus
  31. D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004. View at Publisher · View at Google Scholar · View at Scopus
  32. H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool, “Speeded-up robust features (SURF),” Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346–359, 2008. View at Publisher · View at Google Scholar · View at Scopus
  33. R. Arandjelović and A. Zisserman, “Three things everyone should know to improve object retrieval,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR '12), pp. 2911–2918, IEEE, Providence, RI, USA, June 2012. View at Publisher · View at Google Scholar · View at Scopus
  34. X. Peng, L. Wang, X. Wang, and Y. Qiao, “Bag of visual words and fusion methods for action recognition: Comprehensive Study and Good Practice,” Computer Vision and Image Understanding, vol. 150, 2016. View at Publisher · View at Google Scholar · View at Scopus
  35. H. Abdi and L. J. Williams, “Principal component analysis,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 4, pp. 433–459, 2010. View at Publisher · View at Google Scholar · View at Scopus
  36. J. C. Van Gemert, C. J. Veenman, A. W. M. Smeulders, and J.-M. Geusebroek, “Visual word ambiguity,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, no. 7, pp. 1271–1283, 2010. View at Publisher · View at Google Scholar · View at Scopus
  37. D. Arthur and S. Vassilvitskii, “K-means++: the advantages of careful seeding,” in Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '07), pp. 1027–1035, SIAM, January 2007. View at Scopus
  38. G. McLachlan and D. Peel, Finite Mixture Models, John Wiley & Sons, New York, NY, USA, 2004.
  39. E. Hartuv and R. Shamir, “Clustering algorithm based on graph connectivity,” Information Processing Letters, vol. 76, no. 4-6, pp. 175–181, 2000. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  40. M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD '96), pp. 226–231, Portland, Ore, USA, August 1996.
  41. F. Murtagh, “Complexities of hierarchic clustering algorithms: state of the art,” Computational Statistics Quarterly, vol. 1, pp. 101–113, 1984. View at Google Scholar
  42. L. Liu, L. Wang, and X. Liu, “In defense of soft-assignment coding,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV '11), pp. 2486–2493, IEEE, Barcelona, Spain, November 2011. View at Publisher · View at Google Scholar · View at Scopus
  43. J. Yang, K. Yu, Y. Gong, and T. Huang, “Linear spatial pyramid matching using sparse coding for image classification,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR '09), pp. 1794–1801, June 2009. View at Publisher · View at Google Scholar · View at Scopus
  44. K. Yu, T. Zhang, and Y. Gong, “Nonlinear learning using local coordinate coding,” in Proceedings of the 22nd International Conference on Neural Information Processing Systems (NIPS '09), pp. 2223–2231, British Columbia, Canada, December 2009.
  45. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, “Locality-constrained linear coding for image classification,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 3360–3367, June 2010. View at Publisher · View at Google Scholar · View at Scopus
  46. J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek, “Image classification with the fisher vector: theory and practice,” International Journal of Computer Vision, vol. 105, no. 3, pp. 222–245, 2013. View at Publisher · View at Google Scholar · View at Scopus
  47. H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez, and C. Schmid, “Aggregating local image descriptors into compact codes,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 9, pp. 1704–1716, 2012. View at Publisher · View at Google Scholar · View at Scopus
  48. K. Yu and T. Zhang, “Improved local coordinate coding using local tangents,” in Proceedings of the 27th International Conference on Machine Learning (ICML '10), pp. 1215–1222, IMLS, Haifa, Israel, June 2010. View at Scopus
  49. X. Zhou, K. Yu, T. Zhang, and T. S. Huang, “Image classification using super-vector coding of local image descriptors,” in Proceedings of the 11th European Conference on Computer Vision (ECCV '10), pp. 141–154, Springer, Crete, Greece, September 2010.
  50. P. Koniusz, F. Yan, and K. Mikolajczyk, “Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection,” Computer Vision and Image Understanding, vol. 117, no. 5, pp. 479–492, 2013. View at Publisher · View at Google Scholar · View at Scopus
  51. K. Simonyan, O. M. Parkhi, A. Vedaldi, and A. Zisserman, “Fisher vector faces in the wild,” in Proceedings of the 24th British Machine Vision Conference (BMVC '13), September 2013. View at Publisher · View at Google Scholar · View at Scopus
  52. Y. Lin, F. Lv, S. Zhu et al., “Large-scale image classification: fast feature extraction and SVM training,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR '11), pp. 1689–1696, June 2011. View at Publisher · View at Google Scholar · View at Scopus
  53. Y.-L. Boureau, J. Ponce, and Y. LeCun, “A theoretical analysis of feature pooling in visual recognition,” in Proceedings of the 27th International Conference on Machine Learning (ICML '10), pp. 111–118, Haifa, Israel, June 2010.
  54. Y.-L. Boureau, Learning Hierarchical Feature Extractors for Image Recognition, New York University, New York, NY, USA, 2014.
  55. N. Murray and F. Perronnin, “Generalized max pooling,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14), pp. 2473–2480, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  56. G. Chowdhury, Introduction to Modern Information Retrieval, Facet Publishing, London, UK, 2010.
  57. F. Perronnin, J. Sánchez, and T. Mensink, “Improving the fisher kernel for large-scale image classification,” in Computer Vision—ECCV 2010, pp. 143–156, Springer, Berlin, Germany, 2010. View at Google Scholar
  58. R. Arandjelovic and A. Zisserman, “All about VLAD,” in Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '13), pp. 1578–1585, June 2013. View at Publisher · View at Google Scholar · View at Scopus
  59. C. J. Lin, C.-W. Hsu, and C.-C. Chang, “A practical guide to support vector classification,” National Taiwan University, 2003 http://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf.
  60. R. Hadsell, A. Erkan, P. Sermanet, M. Scoffier, U. Muller, and Y. LeCun, “Deep belief net learning in a long-range vision system for autonomous off-road driving,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '08), pp. 628–633, September 2008. View at Publisher · View at Google Scholar · View at Scopus
  61. R. Hadsell, P. Sermanet, J. Ben et al., “Learning long-range vision for autonomous off-road driving,” Journal of Field Robotics, vol. 26, no. 2, pp. 120–144, 2009. View at Publisher · View at Google Scholar · View at Scopus
  62. A. Oliva and A. Torralba, “Modeling the shape of the scene: a holistic representation of the spatial envelope,” International Journal of Computer Vision, vol. 42, no. 3, pp. 145–175, 2001. View at Publisher · View at Google Scholar · View at Scopus
  63. R. Jain, The Art of Computer Systems Performance Analysis, John Wiley & Sons, 2008.
  64. S. Avila, N. Thome, M. Cord, and A. D. A. Araújo, “Pooling in image representation: the visual codeword point of view,” Computer Vision and Image Understanding, vol. 117, no. 5, pp. 453–465, 2013. View at Publisher · View at Google Scholar · View at Scopus