Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2018, Article ID 3598316, 7 pages
https://doi.org/10.1155/2018/3598316
Research Article

Object Detection Based on Fast/Faster RCNN Employing Fully Convolutional Architectures

1ATR National Lab, National University of Defense Technology, Changsha 410073, China
2State Key Lab of Complex Electromagnetic Environment Effects on Electronics and Information System, National University of Defense Technology, Changsha 410073, China

Correspondence should be addressed to Yun Ren; moc.361@tdun_nuyner

Received 23 October 2017; Accepted 12 December 2017; Published 9 January 2018

Academic Editor: Francesco Marotti de Sciarra

Copyright © 2018 Yun Ren et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. Huang, V. Rathod, C. Sun et al., “Speed/accuracy trade-offs for modern convolutional object detectors,” in Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '17), pp. 3296-3297, Honolulu, HI, USA, July 2017. View at Publisher · View at Google Scholar
  2. S. Ren, K. He, R. Girshick, X. Zhang, and J. Sun, “Object detection networks on convolutional feature maps,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 7, pp. 1476–1481, 2017. View at Publisher · View at Google Scholar
  3. D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004. View at Publisher · View at Google Scholar · View at Scopus
  4. N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '05), vol. 1, pp. 886–893, June 2005. View at Publisher · View at Google Scholar · View at Scopus
  5. P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan, “Object detection with discriminatively trained part-based models,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627–1645, 2010. View at Publisher · View at Google Scholar · View at Scopus
  6. X. Wang, M. Yang, S. Zhu, and Y. Lin, “Regionlets for generic object detection,” in Proceedings of the 2013 14th IEEE International Conference on Computer Vision (ICCV '13), pp. 17–24, December 2013. View at Publisher · View at Google Scholar · View at Scopus
  7. J. R. R. Uijlings, K. E. A. Van De Sande, T. Gevers, and A. W. M. Smeulders, “Selective search for object recognition,” International Journal of Computer Vision, vol. 104, no. 2, pp. 154–171, 2013. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Yan, Y. Xia, J. S. Smith, W. Lu, and B. Zhang, “Multiscale Convolutional Neural Networks for Hand Detection,” Applied Computational Intelligence and Soft Computing, vol. 2017, pp. 1–13, 2017. View at Publisher · View at Google Scholar
  9. Y. Sun, L. Zhu, G. Wang, and F. Zhao, “Multi-input convolutional neural network for flower grading,” Journal of Electrical & Computer Engineering, vol. 2017, Article ID 9240407, 8 pages, 2017. View at Publisher · View at Google Scholar
  10. Y. Xu, G. Yu, Y. Wang, X. Wu, and Y. Ma, “Car detection from low-altitude UAV imagery with the faster R-CNN,” Journal of Advanced Transportation, vol. 2017, Article ID 2823617, 11 pages, 2017. View at Publisher · View at Google Scholar
  11. L. Zhou, Q. Li, G. Huo, and Y. Zhou, “Image classification using biomimetic pattern recognition with convolutional neural networks features,” Computational Intelligence and Neuroscience, vol. 2017, Article ID 3792805, 12 pages, 2017. View at Publisher · View at Google Scholar · View at Scopus
  12. R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14), pp. 580–587, Columbus, Ohio, USA, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  13. K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” in Proceedings of the European Conference on Computer Vision, vol. 8691, pp. 346–361, Springer, 2014. View at Publisher · View at Google Scholar · View at Scopus
  14. R. Girshick, “Fast R-CNN,” in Proceedings of the 15th IEEE International Conference on Computer Vision (ICCV '15), pp. 1440–1448, December 2015. View at Publisher · View at Google Scholar · View at Scopus
  15. S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: towards real-time object detection with region proposal networks,” in Advances in Neural Information Processing Systems, pp. 91–99, 2015. View at Google Scholar · View at Scopus
  16. J. Deng, W. Dong, and R. Socher, “ImageNet: a large-scale hierarchical image database,” in Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR' 09), pp. 248–255, IEEE, Miami, FL, USA, June 2009. View at Publisher · View at Google Scholar
  17. K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '16), pp. 770–778, July 2016. View at Scopus
  18. C. Szegedy, W. Liu, Y. Jia et al., “Going deeper with convolutions,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR '15), pp. 1–9, Boston, MA, USA, June 2015. View at Publisher · View at Google Scholar
  19. C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Rethinking the inception architecture for computer vision,” in Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '16), pp. 2818–2826, July 2016. View at Scopus
  20. K. Kim H, S. Hong, B. Roh et al., “PVANET: deep but lightweight neural networks for real-time object detection,” Computer Vision and Pattern Recognition, 2016, https://arxiv.org/abs/1608.08021. View at Google Scholar
  21. T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” Computer Vision and Pattern Recognition, 2016, https://arxiv.org/abs/1612.03144. View at Google Scholar
  22. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998. View at Publisher · View at Google Scholar · View at Scopus
  23. Y. LeCun, C. Cortes, and C. J. C. Burges, “MNIST handwritten digit database,” AT & T Labs, p. 2, 2010, http://yann.lecun.com/exdb/mnist. View at Google Scholar
  24. A. Krizhevsky, V. Nair, and G. Hinton, “The CIFAR-10 dataset,” http://www.cs.toronto.edu/kriz/cifar.html.
  25. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929–1958, 2014. View at Google Scholar · View at MathSciNet
  26. S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in Proceedings of the 32nd International Conference on Machine Learning (ICML '15), pp. 448–456, July 2015. View at Scopus
  27. M. D. Zeiler and R. Fergus, “Visualizing and understanding convolutional networks,” in Proceedings of the European conference on computer vision, vol. 8689 of Lecture Notes in Computer Science, pp. 818–833, Springer, 2014. View at Publisher · View at Google Scholar
  28. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” Computer Vision and Pattern Recognition, 2014, https://arxiv.org/abs/1409.1556. View at Google Scholar
  29. Y. Jia, E. Shelhamer, J. Donahue et al., “Caffe: convolutional architecture for fast feature embedding,” in Proceedings of the ACM International Conference on Multimedia, pp. 675–678, ACM, Orlando, FL, USA, November 2014. View at Publisher · View at Google Scholar
  30. F. Yu and V. Koltun, “Multi-scale context aggregation by dilated convolutions,” Computer Vision and Pattern Recognition, 2015, https://arxiv.org/abs/1511.07122. View at Google Scholar