Table of Contents Author Guidelines Submit a Manuscript
Mobile Information Systems
Volume 2017, Article ID 8104386, 14 pages
https://doi.org/10.1155/2017/8104386
Research Article

Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles

1College of Electronic Science and Engineering, National University of Defense Technology, Changsha, Hunan, China
2Department of Computing Science, University of Alberta, Edmonton, AB, Canada T6G 2E8

Correspondence should be addressed to Yi Hou; moc.liamg@eiwohuohiy

Received 12 May 2017; Revised 28 July 2017; Accepted 11 October 2017; Published 9 November 2017

Academic Editor: Paolo Bellavista

Copyright © 2017 Yi Hou et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. S. Lowry, N. Süenderhauf, P. Newman et al., “Visual place recognition: a survey,” IEEE Transactions on Robotics, vol. 32, no. 1, pp. 1–19, 2016. View at Google Scholar
  2. M. Cummins and P. Newman, “FAB-MAP: probabilistic localization and mapping in the space of appearance,” International Journal of Robotics Research, vol. 27, no. 6, pp. 647–665, 2008. View at Publisher · View at Google Scholar · View at Scopus
  3. G. Singh and J. Kosecka, “Visual loop closing using gist descriptors in Manhattan world,” in Proceedings of the in IEEE International Conference on Robotics and Automation (ICRA) Omnidirectional Robot Vision Workshop, 2010.
  4. H. Zhang, “BoRF: Loop-closure detection with scale invariant visual features,” in Proceedings of the 2011 IEEE International Conference on Robotics and Automation, ICRA 2011, pp. 3125–3130, May 2011. View at Publisher · View at Google Scholar · View at Scopus
  5. N. Sünderhauf and P. Protzel, “BRIEF-Gist - Closing the loop by simple means,” in Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems: Celebrating 50 Years of Robotics, IROS'11, pp. 1234–1241, September 2011. View at Publisher · View at Google Scholar · View at Scopus
  6. Y. Liu and H. Zhang, “Visual loop closure detection with a compact image descriptor,” in Proceedings of the 25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012, pp. 1051–1056, October 2012. View at Publisher · View at Google Scholar · View at Scopus
  7. M. J. Milford and G. F. Wyeth, “SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 1643–1649, 2012. View at Publisher · View at Google Scholar · View at Scopus
  8. Y. Liu and H. Zhang, “Towards improving the efficiency of sequence-based SLAM,” in Proceedings of the 10th IEEE International Conference on Mechatronics and Automation (ICMA '13), pp. 1261–1266, Takamatsu, Japan, August 2013. View at Publisher · View at Google Scholar · View at Scopus
  9. M. Milford, “Vision-based place recognition: How low can you go?” International Journal of Robotics Research, vol. 32, no. 7, pp. 766–789, 2013. View at Publisher · View at Google Scholar · View at Scopus
  10. E. Pepperell, P. I. Corke, and M. J. Milford, “All-environment visual place recognition with SMART,” in Proceedings of the 2014 IEEE International Conference on Robotics and Automation, ICRA 2014, pp. 1612–1618, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  11. T. Naseer, L. Spinello, W. Burgard, and C. Stachniss, “Robust visual robot localization across seasons using network flows,” in Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2564–2570, July 2014. View at Scopus
  12. Z. Chen, O. Lam, A. Jacobson, and M. Milford, “Convolutional neural network-based place recognition,” in Proceedings of the Australasian Conference on Robotics and Automation, ACRA 2014, pp. 2–4, December 2014. View at Scopus
  13. N. Sünderhauf, S. Shirazi, A. Jacobson et al., “Place recognition with convnet landmarks: Viewpoint-robust, condition-robust, training-free,” in Proceedings of the 2015 Robotics: Science and Systems Conference, RSS 2015, Rome, Italy, July 2015. View at Publisher · View at Google Scholar · View at Scopus
  14. P. Neubert and P. Protzel, “Local region detector + CNN based landmarks for practical place recognition in changing environments,” in Proceedings of the European Conference on Mobile Robots, ECMR 2015, pp. 1–6, September 2015. View at Publisher · View at Google Scholar · View at Scopus
  15. P. Neubert and P. Protzel, “Beyond Holistic Descriptors, Keypoints, and Fixed Patches: Multiscale Superpixel Grids for Place Recognition in Changing Environments,” IEEE Robotics and Automation Letters, vol. 1, no. 1, pp. 484–491, 2016. View at Publisher · View at Google Scholar
  16. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Proceedings of the 26th Annual Conference on Neural Information Processing Systems (NIPS '12), pp. 1097–1105, December 2012. View at Scopus
  17. K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, “Return of the devil in the details: delving deep into convolutional nets,” in Proceedings of the 25th British Machine Vision Conference (BMVC '14), Nottingham, UK, September 2014. View at Scopus
  18. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in Proceedings of the in International Conference on Learning Representations (ICLR), 2015.
  19. R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14), pp. 580–587, Columbus, Ohio, USA, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  20. R. Girshick, “Fast R-CNN,” in Proceedings of the 15th IEEE International Conference on Computer Vision (ICCV '15), pp. 1440–1448, December 2015. View at Publisher · View at Google Scholar · View at Scopus
  21. S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: towards real-time object detection with region proposal networks,” in Advances in Neural Information Processing Systems, pp. 91–99, 2015. View at Google Scholar · View at Scopus
  22. M. Labbe and F. Michaud, “Appearance-based loop closure detection for online large-scale and long-term operation,” IEEE Transactions on Robotics, vol. 29, no. 3, pp. 734–745, 2013. View at Publisher · View at Google Scholar · View at Scopus
  23. M. Labbe and F. Michaud, “Online global loop closure detection for large-scale multi-session graph-based SLAM,” in Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2014, pp. 2661–2666, September 2014. View at Publisher · View at Google Scholar · View at Scopus
  24. D. Gálvez-López and J. D. Tardós, “Bags of binary words for fast place recognition in image sequences,” IEEE Transactions on Robotics, vol. 28, no. 5, pp. 1188–1197, 2012. View at Publisher · View at Google Scholar · View at Scopus
  25. R. Mur-Artal, J. M. M. Montiel, and J. D. Tardos, “ORB-SLAM: a versatile and accurate monocular SLAM system,” IEEE Transactions on Robotics, vol. 31, no. 5, pp. 1147–1163, 2015. View at Publisher · View at Google Scholar · View at Scopus
  26. Y. Hou, H. Zhang, and S. Zhou, “Convolutional neural network-based image representation for visual loop closure detection,” in Proceedings of the IEEE International Conference on Information and Automation(ICIA), pp. 2238–2245, August 2015. View at Publisher · View at Google Scholar · View at Scopus
  27. N. Sünderhauf, S. Shirazi, F. Dayoub, B. Upcroft, and M. Milford, “On the performance of ConvNet features for place recognition,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2015, pp. 4297–4304, October 2015. View at Publisher · View at Google Scholar · View at Scopus
  28. A. Oliva and A. Torralba, “Modeling the shape of the scene: a holistic representation of the spatial envelope,” International Journal of Computer Vision, vol. 42, no. 3, pp. 145–175, 2001. View at Publisher · View at Google Scholar · View at Scopus
  29. J. Sivic and A. Zisserman, “Video google: a text retrieval approach to object matching in videos,” in Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV '03), pp. 1470–1477, October 2003. View at Scopus
  30. F. Perronnin and C. Dance, “Fisher kernels on visual vocabularies for image categorization,” in Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR'07, pp. 1–8, June 2007. View at Publisher · View at Google Scholar · View at Scopus
  31. H. Jégou, M. Douze, C. Schmid, and P. Pérez, “Aggregating local descriptors into a compact image representation,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 3304–3311, June 2010. View at Publisher · View at Google Scholar · View at Scopus
  32. T. Naseer, M. Ruhnke, C. Stachniss, L. Spinello, and W. Burgard, “Robust visual SLAM across seasons,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2015, pp. 2529–2535, October 2015. View at Publisher · View at Google Scholar · View at Scopus
  33. M. Shahid, T. Naseer, and W. Burgard, “DTLC: Deeply trained loop closure detections for lifelong visual SLAM,” in Proceedings of the RSS Workshop on Visual Place Recognition: What is it Good For? 2016.
  34. R. Arroyo, P. F. Alcantarilla, L. M. Bergasa, and E. Romera, “Fusion and binarization of CNN features for robust topological localization across seasons,” in Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2016, pp. 4656–4663, October 2016. View at Publisher · View at Google Scholar · View at Scopus
  35. Y. Liu, R. Feng, and H. Zhang, “Keypoint matching by outlier pruning with consensus constraint,” in Proceedings of the 2015 IEEE International Conference on Robotics and Automation, ICRA 2015, pp. 5481–5486, May 2015. View at Publisher · View at Google Scholar · View at Scopus
  36. A. J. Glover, W. P. Maddern, M. J. Milford, and G. F. Wyeth, “Fab-map + ratslam: appearance-based slam for multiple times of day,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA '10), pp. 3507–3512, May 2010. View at Publisher · View at Google Scholar · View at Scopus
  37. P. Neubert, N. Sünderhauf, and P. Protzel, “Superpixel-based appearance change prediction for long-term navigation across seasons,” Robotics and Autonomous Systems, vol. 69, no. 1, pp. 15–27, 2015. View at Publisher · View at Google Scholar · View at Scopus
  38. Y. Hou, H. Zhang, and S. Zhou, “Tree-based indexing for real-time ConvNet landmark-based visual place recognition,” International Journal of Advanced Robotic Systems, vol. 14, no. 1, 2017. View at Publisher · View at Google Scholar · View at Scopus
  39. Mapillary, 2016, https://www.mapillary.com.
  40. M.-M. Cheng, Z. Zhang, W.-Y. Lin, and P. Torr, “BING: Binarized normed gradients for objectness estimation at 300 fps,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14), pp. 3286–3293, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  41. C. L. Zitnick and P. Dollár, “Edge boxes: locating object proposals from edges,” in Proceedings of the European Conference on Computer Vision (ECCV '14), pp. 391–405, Springer, 2014. View at Publisher · View at Google Scholar
  42. J. Hosang, R. Benenson, P. Dollar, and B. Schiele, “What makes for effective detection proposals?” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 38, no. 4, pp. 814–830, 2016. View at Publisher · View at Google Scholar · View at Scopus
  43. S. Dasgupta, “Experiments with random projection,” in Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 143–151, 2000.
  44. E. Bingham and H. Mannila, “Random projection in dimensionality reduction: applications to image and text data,” in Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '01), pp. 245–250, August 2001. View at Scopus
  45. Y. Jia, E. Shelhamer, J. Donahue et al., “Caffe: convolutional architecture for fast feature embedding,” in Proceedings of the ACM International Conference on Multimedia, pp. 675–678, ACM, Orlando, Fla, USA, November 2014. View at Publisher · View at Google Scholar
  46. W. Su, Y. Yuan, and M. Zhu, “A relationship between the average precision and the area under the ROC curve,” in Proceedings of the 5th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2015, pp. 349–352, September 2015. View at Publisher · View at Google Scholar · View at Scopus