Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2017, Article ID 6727105, 11 pages
https://doi.org/10.1155/2017/6727105
Research Article

Kernel-Based Multiview Joint Sparse Coding for Image Annotation

1School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, No. 10, Xitucheng Road, Haidian District, Beijing 100876, China
2School of Electronics and Information Engineering, North China University of Technology, No. 5, Jinyuanzhang Road, Shijingshan District, Beijing 100144, China
3School of Computer Science, North China University of Technology, No. 5, Jinyuanzhang Road, Shijingshan District, Beijing 100144, China

Correspondence should be addressed to Miao Zang; nc.ude.tucn@mgnaz

Received 28 October 2016; Accepted 11 January 2017; Published 19 March 2017

Academic Editor: Wanquan Liu

Copyright © 2017 Miao Zang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. D. Zhang, M. M. Islam, and G. Lu, “A review on automatic image annotation techniques,” Pattern Recognition, vol. 45, no. 1, pp. 346–362, 2012. View at Publisher · View at Google Scholar · View at Scopus
  2. X. Li, T. Uricchio, L. Ballan, M. Bertini, C. G. M. Snoek, and A. Del Bimbo, “Socializing the semantic gap: a comparative survey on image tag assignment, refinement, and retrieval,” ACM Computing Surveys, vol. 49, no. 1, article 14, 2016. View at Publisher · View at Google Scholar · View at Scopus
  3. G. Carneiro, A. B. Chan, P. J. Moreno, and N. Vasconcelos, “Supervised learning of semantic classes for image annotation and retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 3, pp. 394–410, 2007. View at Publisher · View at Google Scholar · View at Scopus
  4. Y. Verma and C. V. Jawahar, “Exploring SVM for image annotation in presence of confusing labels,” in Proceedings of the 24th British Machine Vision Conference, pp. 1–11, 2013.
  5. M.-L. Zhang and L. Wu, “LIFT: multi-label learning with label-specific features,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 1, pp. 107–120, 2015. View at Publisher · View at Google Scholar · View at Scopus
  6. O. Yakhnenko and V. Honavar, “Annotating images and image objects using a hierarchical dirichlet process model,” in Proceedings of the 9th International Workshop on Multimedia Data Mining, pp. 1–7, Las Vegas, Nev, USA, August 2008.
  7. D. Putthividhya, H. T. Attias, and S. S. Nagarajan, “Topic regression multi-modal Latent Dirichlet Allocation for image annotation,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 3408–3415, IEEE, San Francisco, Calif, USA, June 2010. View at Publisher · View at Google Scholar · View at Scopus
  8. A. Makadia, V. Pavlovic, and S. Kumar, “A new baseline for image annotation,” in Proceedings of the 10th European Conference on Computer Vision, pp. 316–329, Marseille, France, October 2008.
  9. M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, “TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation,” in Proceedings of the IEEE 12th International Conference on Computer Vision (ICCV '09), pp. 309–316, IEEE, Kyoto, Japan, September-October 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. Y. Verma and C. V. Jawahar, “Image annotation by propagating labels from semantic neighbourhoods,” International Journal of Computer Vision, vol. 121, no. 1, pp. 126–148, 2017. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  11. X. Li, C. G. M. Snoek, and M. Worring, “Learning social tag relevance by neighbor voting,” IEEE Transactions on Multimedia, vol. 11, no. 7, pp. 1310–1322, 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. Z. Chen, J. Cao, T. Xia, Y. Song, Y. Zhang, and J. Li, “Web video retagging,” Multimedia Tools and Applications, vol. 55, no. 1, pp. 53–82, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. C. Wang, S. Yan, L. Zhang, and H.-J. Zhang, “Multi-label sparse coding for automatic image annotation,” in Proceedings of the IEEE 22nd International Conference on Computer Vision and Pattern Recognition (CVPR '09), pp. 1643–1650, IEEE, Miami, Fla, USA, June 2009. View at Publisher · View at Google Scholar · View at Scopus
  14. S. Gao, L.-T. Chia, I. W.-H. Tsang, and Z. Ren, “Concurrent single-label image classification and annotation via efficient multi-layer group sparse coding,” IEEE Transactions on Multimedia, vol. 16, no. 3, pp. 762–771, 2014. View at Publisher · View at Google Scholar · View at Scopus
  15. X. Cao, H. Zhang, X. Guo, S. Liu, and D. Meng, “SLED: semantic label embedding dictionary representation for multilabel image annotation,” IEEE Transactions on Image Processing, vol. 24, no. 9, pp. 2746–2759, 2015. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  16. Z. Lu, P. Han, L. Wang, and J.-R. Wen, “Semantic sparse recoding of visual content for image applications,” IEEE Transactions on Image Processing, vol. 24, no. 1, pp. 176–188, 2015. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  17. X.-Y. Jing, F. Wu, Z. Li, R. Hu, and D. Zhang, “Multi-label dictionary learning for image annotation,” IEEE Transactions on Image Processing, vol. 25, no. 6, pp. 2712–2725, 2016. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  18. S. Moran and V. Lavrenko, “Sparse kernel learning for image annotation,” in Proceedings of the 4th ACM International Conference on Multimedia Retrieval (ICMR '14), pp. 113–120, ACM, Glasgow, UK, April 2014. View at Publisher · View at Google Scholar · View at Scopus
  19. M. M. Kalayeh, H. Idrees, and M. Shah, “NMF-KNN: image annotation using weighted multi-view non-negative matrix factorization,” in Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14), pp. 184–191, IEEE, Columbus, Ohio, USA, June 2014. View at Publisher · View at Google Scholar · View at Scopus
  20. W. Liu, D. Tao, J. Cheng, and Y. Tang, “Multiview Hessian discriminative sparse coding for image annotation,” Computer Vision and Image Understanding, vol. 118, pp. 50–60, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. X.-T. Yuan, X. Liu, and S. Yan, “Visual classification with multitask joint sparse representation,” IEEE Transactions on Image Processing, vol. 21, no. 10, pp. 4349–4360, 2012. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  22. J. Yang, K. Yu, Y. Gong, and T. Huang, “Linear spatial pyramid matching using sparse coding for image classification,” in Proceedings of the IEEE 22nd International Conference on Computer Vision and Pattern Recognition (CVPR '09), pp. 1794–1801, IEEE, Miami, Fla, USA, June 2009. View at Publisher · View at Google Scholar · View at Scopus
  23. S. Gao, I. W. Tsang, and L.-T. Chia, “Sparse representation with kernels,” IEEE Transactions on Image Processing, vol. 22, no. 2, pp. 423–434, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  24. H. V. Nguyen, V. M. Patel, N. M. Nasrabadi, and R. Chellappa, “Kernel dictionary learning,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '12), pp. 2021–2024, IEEE, Kyoto, Japan, March 2012. View at Publisher · View at Google Scholar · View at Scopus
  25. J. C. Bezdek and R. J. Hathaway, “Convergence of alternating optimization,” Neural, Parallel and Scientific Computations, vol. 11, no. 4, pp. 351–368, 2003. View at Google Scholar · View at MathSciNet
  26. X. Chen, W. Pan, J. T. Kwok, and J. G. Carbonell, “Accelerated gradient method for multi-task sparse learning problem,” in Proceedings of the IEEE 5th International Conference on Data Mining, pp. 746–751, Las Vegas, Nev, USA, July 2009.
  27. X.-T. Yuan and S. C. Yan, “Visual classification with multi-task joint sparse representation,” in Proceedings of the IEEE 23rd International Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 3493–3500, IEEE, San Francisco, Calif, USA, June 2010. View at Publisher · View at Google Scholar · View at Scopus
  28. P. Duygulu, K. Barnard, J. F. G. Freitas, and D. A. Forsyth, “Object recognition as machine translation: learning a lexicon for a fixed image vocabulary,” in Proceedings of the 7th European Conference on Computer Vision, pp. 97–112, Copenhagen, Denmark, May 2002.
  29. S. Zhang, J. Huang, H. Li, and D. N. Metaxas, “Automatic image annotation and retrieval using group sparsity,” IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, vol. 42, no. 3, pp. 838–849, 2012. View at Publisher · View at Google Scholar · View at Scopus
  30. M. Grubinger, P. Clough, H. Müller, and T. Deselaers, “The IAPR TC-12 benchmark: a new evaluation resource for visual information systems,” in Proceedings of the International Conference on Language Resources and Evaluation, Genoa, Italy, May 2006.
  31. M. J. Huiskes and M. S. Lew, “The MIR Flickr retrieval evaluation,” in Proceedings of the ACM International Conference on Multimedia Information Retrieval, pp. 39–43, Vancouver, Canada, October 2008.
  32. J. Verbeek, M. Guillaumin, T. Mensink, and C. Schmid, “Image annotation with tagprop on the MIRFlickr set,” in Proceedings of the 11th ACM International Conference on Multimedia Information Retrieval, pp. 537–546, Philadelphia, Pa, USA, March 2010.
  33. V. Bruni and D. Vitulano, “An improvement of kernel-based object tracking based on human perception,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 44, no. 11, pp. 1474–1485, 2014. View at Publisher · View at Google Scholar · View at Scopus
  34. F. Wu, Z. Wang, Z. Zhang et al., “Weakly semi-supervised deep learning for multi-label image annotation,” IEEE Transactions on Big Data, vol. 1, no. 3, pp. 109–122, 2015. View at Publisher · View at Google Scholar