Table of Contents Author Guidelines Submit a Manuscript
Advances in Multimedia
Volume 2018, Article ID 6153607, 11 pages
https://doi.org/10.1155/2018/6153607
Research Article

Combining Convolutional Neural Network and Markov Random Field for Semantic Image Retrieval

1School of Information Technology in Education, South China Normal University, Guangzhou, China
2Guangdong Engineering Research Center for Smart Learning, South China Normal University, Guangzhou, China
3School of Computing and Mathematics, Charles Sturt University, Albury, NSW, Australia
4School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China

Correspondence should be addressed to Changqin Huang; nc.ude.uncs@gnauhqc

Received 4 May 2018; Accepted 12 June 2018; Published 1 August 2018

Academic Editor: Yong Luo

Copyright © 2018 Haijiao Xu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. M. Kawanabe, A. Binder, C. Müller, and W. Wojcikiewicz, “Multi-modal visual concept classification of images via Markov random walk over tags,” in Proceedings of the IEEE Workshop on Applications of Computer Vision, pp. 396–401, 2011. View at Scopus
  2. S. Zhang, J. Huang, H. Li, and D. N. Metaxas, “Automatic image annotation and retrieval using group sparsity,” IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol. 42, no. 3, pp. 838–849, 2012. View at Publisher · View at Google Scholar · View at Scopus
  3. S. Tang, Y.-D. Zhang, Z.-X. Xu, H.-J. Li, Y.-T. Zheng, and J.-T. Li, “An efficient concept detection system via sparse ensemble learning,” Neurocomputing, vol. 169, pp. 124–133, 2015. View at Publisher · View at Google Scholar · View at Scopus
  4. Z. Guan, L. Zhang, J. Peng, and J. Fan, “Multi-View Concept Learning for Data Representation,” IEEE Transactions on Knowledge and Data Engineering, vol. 27, no. 11, pp. 3016–3028, 2015. View at Publisher · View at Google Scholar · View at Scopus
  5. D. Grangier and S. Bengio, “A discriminative kernel-based approach to rank images from text queries,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 8, pp. 1371–1384, 2008. View at Publisher · View at Google Scholar · View at Scopus
  6. M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, “TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation,” in Proceedings of the International Conference on Computer Vision, pp. 309–316, 2009. View at Scopus
  7. C. Szegedy, W. Liu, Y. Jia et al., “Going deeper with convolutions,” in Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 1–9, 2015. View at Scopus
  8. F. Radenović, G. Tolias, and O. Chum, “CNN image retrieval learns from BoW: unsupervised fine-tuning with hard examples,” in Proceedings of the European Conference on Computer Vision, pp. 3–20, 2016. View at Google Scholar
  9. G. Hu, X. Peng, Y. Yang, T. M. Hospedales, and J. Verbeek, “Frankenstein: learning deep face representations using small data,” IEEE Transactions on Image Processing, vol. 27, no. 1, pp. 293–303, 2018. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  10. Z. Liu, X. Li, P. Luo, C.-C. Loy, and X. Tang, “Semantic image segmentation via deep parsing network,” in Proceedings of the International Conference on Computer Vision, pp. 1377–1385, 2015. View at Scopus
  11. C. Dong, C. C. Loy, K. He, and X. Tang, “Image super-resolution using deep convolutional networks,” IEEE Trans. Pattern Anal. Mach. Intell, vol. 38, pp. 295–307, 2016. View at Publisher · View at Google Scholar
  12. S. Nowak, K. Nagel, and J. Liebetrau, “The CLEF 2011 photo annotation and concept-based retrieval tasks,” in Proceedings of the CLEF Conference and Labs of the Evaluation Forum, pp. 1–25, 2011. View at Scopus
  13. T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng, “NUS-WIDE: a real-world web image database from National University of Singapore,” in Proceedings of International Conference on Image and Video Retrieval, pp. 48–56, 2009.
  14. D. Wang and M. Li, “Stochastic Configuration Networks: Fundamentals and Algorithms,” IEEE Transactions on Cybernetics, vol. 47, no. 10, pp. 3466–3479, 2017. View at Publisher · View at Google Scholar · View at Scopus
  15. C.-T. Nguyen, N. Kaothanthong, T. Tokuyama, and X.-H. Phan, “A feature-word-topic model for image annotation and retrieval,” ACM Transactions on the Web (TWEB), vol. 7, no. 3, pp. 1–24, 2013. View at Google Scholar · View at Scopus
  16. R. L. Cilibrasi and P. M. B. Vitányi, “The google similarity distance,” IEEE Transactions on Knowledge and Data Engineering, vol. 19, no. 3, pp. 370–383, 2007. View at Publisher · View at Google Scholar · View at Scopus
  17. S. Zhu, C.-W. Ngo, and Y.-G. Jiang, “Sampling and ontologically pooling web images for visual concept learning,” IEEE Transactions on Multimedia, vol. 14, no. 4, pp. 1068–1078, 2012. View at Publisher · View at Google Scholar · View at Scopus
  18. J.-M. Laferte, P. Perez, and F. Heitz, “Discrete Markov image modeling and inference on the quadtree,” IEEE Transactions on Image Processing, vol. 9, no. 3, pp. 390–404, 2000. View at Publisher · View at Google Scholar · View at MathSciNet
  19. D. Metzler and W. B. Croft, “Latent concept expansion using Markov random fields,” in Proceedings of the ACM International SIGIR Conference, pp. 311–318, 2007. View at Scopus
  20. Y. Xiang, X. Zhou, T.-S. Chua, and C.-W. Ngo, “A revisit of generative model for automatic image annotation using markov random fields,” in Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2009, pp. 1153–1160, June 2009. View at Scopus
  21. Z. Lu and H. H. S. Ip, “Spatial Markov kernels for image categorization and annotation,” IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol. 41, no. 4, pp. 976–989, 2011. View at Publisher · View at Google Scholar · View at Scopus
  22. X. Dong, J. Shen, L. Shao, and L. Van Gool, “Sub-Markov random walk for image segmentation,” IEEE Transactions on Image Processing, vol. 25, no. 2, pp. 516–527, 2016. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  23. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” https://arxiv.org/abs/1409.1556v6.
  24. T. Hoang, T.-T. Do, D.-K. Le Tan, and N.-M. Cheung, “Selective deep convolutional features for image retrieval,” in Proceedings of the 25th ACM International Conference on Multimedia, MM 2017, pp. 1600–1608, October 2017. View at Scopus
  25. S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137–1149, 2017. View at Publisher · View at Google Scholar
  26. C. Xu, C. Lu, X. Liang et al., “Multi-loss Regularized Deep Neural Network,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 12, pp. 2273–2283, 2016. View at Publisher · View at Google Scholar · View at Scopus
  27. K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, pp. 770–778, July 2016. View at Scopus
  28. D. Wang and C. Cui, “Stochastic configuration networks ensemble with heterogeneous features for large-scale data analytics,” Information Sciences, vol. 417, pp. 55–71, 2017. View at Publisher · View at Google Scholar · View at Scopus
  29. G. Huang, Z. Liu, L. v. Maaten, and K. Q. Weinberger, “Densely Connected Convolutional Networks,” in Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261–2269, Honolulu, Hawaii, USA, July 2017. View at Publisher · View at Google Scholar
  30. Y. Gong, Y. Jia, T. Leung, A. Toshev, and S. Ioffe, “Deep convolutional ranking for multilabel image annotation,” https://arxiv.org/abs/1312.4894v2.
  31. C. Wang, N. Komodakis, and N. Paragios, “Markov Random Field modeling, inference and learning in computer vision and image understanding: a survey,” Computer Vision and Image Understanding, vol. 117, no. 11, pp. 1610–1627, 2013. View at Publisher · View at Google Scholar · View at Scopus
  32. X. Li, “Preconditioned Stochastic Gradient Descent,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 5, pp. 1454–1466, 2018. View at Publisher · View at Google Scholar
  33. J. Verbeek, M. Guillaumin, T. Mensink, and C. Schmid, “Image annotation with tagprop on the MIRFLICKR set,” in Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010, pp. 537–546, USA, March 2010. View at Scopus
  34. M. Chen, A. Zheng, and K. Weinberger, “Fast image tagging,” in Proceedings of the International Conference on Machine Learning, pp. 1274–1282, 2013.
  35. N. Srivastava and R. Salakhutdinov, “Multimodal learning with deep Boltzmann machines,” Journal of Machine Learning Research, vol. 15, pp. 2949–2980, 2014. View at Google Scholar · View at MathSciNet
  36. G. Haixiang, L. Yijing, J. Shang, G. Mingyun, H. Yuanyue, and G. Bing, “Learning from class-imbalanced data: Review of methods and applications,” Expert Systems with Applications, vol. 73, pp. 220–239, 2017. View at Publisher · View at Google Scholar · View at Scopus