skip to main content
10.1145/2733373.2806371acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Multi-view Semi-supervised Learning for Web Image Annotation

Published: 13 October 2015 Publication History

Abstract

With the explosive increasing of web image data, image annotation has become a critical research issue for image semantic index and search. In this work, we propose a novel model, termed as multi-view semi-supervised learning (MVSSL), for robust image annotation task. Specifically, we exploit both labeled images and unlabeled images to uncover the intrinsic data structural information. Meanwhile, to comprehensively describe an individual datum, we take advantage of the correlated and complemental information derived from multiple facets of image data (i.e., multiple views or features). We devise a robust pair-wise constraint on outcomes of different views to achieve annotation consistency. Furthermore, we integrate a robust classifier learning component via l2,1 loss, which can provide effective noise identification power during the learning process. Finally, we devise an efficient iterative algorithm to solve the optimization problem in MVSSL. We conduct extensive experiments on the NUS-WIDE dataset, and the results illustrate that our proposed approach is promising for large scale web image annotation task.

References

[1]
T. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. NUS-WIDE: a real-world web image database from national university of singapore. In CIVR, pages 48:1--48:9, 2009.
[2]
J. D. R. Farquhar, D. R. Hardoon, H. Meng, J. Shawe-Taylor, and S. Szedm�k. Two view learning: Svm-2k, theory and practice. In NIPS, pages 355--362, 2005.
[3]
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. B. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM Multimedia, pages 675--678, 2014.
[4]
D. G. Lowe. Object recognition from local scale-invariant features. In ICCV, pages 1150--1157, 1999.
[5]
A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. TPAMI, 22(12):1349--1380, 2000.
[6]
Y. Yang, Y. Yang, Z. Huang, H. T. Shen, and F. Nie. Tag localization with spatial correlations and joint group sparsity. In CVPR, pages 881--888, 2011.
[7]
Y. Yang, Z. Zha, Y. Gao, X. Zhu, and T. Chua. Exploiting web images for semantic video indexing via robust sample-specific loss. TMM, 16(6):1677--1689, 2014.
[8]
D. Zhang, M. M. Islam, and G. Lu. A review on automatic image annotation techniques. Pattern Recognition, 45(1):346--362, 2012.
[9]
H. Zhang, Z. Zha, Y. Yang, S. Yan, Y. Gao, and T. Chua. Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. In ACM Multimedia, pages 33--42, 2013.
[10]
X. Zhu. Semi-supervised learning literature survey. 2005.

Cited By

View all
  • (2018)Visual Spatial Attention Network for Relationship DetectionProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240611(510-518)Online publication date: 15-Oct-2018
  • (2018)Hashing with Angular Reconstructive EmbeddingsIEEE Transactions on Image Processing10.1109/TIP.2017.274914727:2(545-555)Online publication date: Feb-2018
  • (2017)Robust Web Image Annotation via Exploring Multi-Facet and Structural KnowledgeIEEE Transactions on Image Processing10.1109/TIP.2017.271718526:10(4871-4884)Online publication date: Oct-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '15: Proceedings of the 23rd ACM international conference on Multimedia
October 2015
1402 pages
ISBN:9781450334594
DOI:10.1145/2733373
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image annotation
  2. multi-view learning
  3. semi-supervised learning

Qualifiers

  • Short-paper

Conference

MM '15
Sponsor:
MM '15: ACM Multimedia Conference
October 26 - 30, 2015
Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;
Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24
The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne , VIC , Australia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Visual Spatial Attention Network for Relationship DetectionProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240611(510-518)Online publication date: 15-Oct-2018
  • (2018)Hashing with Angular Reconstructive EmbeddingsIEEE Transactions on Image Processing10.1109/TIP.2017.274914727:2(545-555)Online publication date: Feb-2018
  • (2017)Robust Web Image Annotation via Exploring Multi-Facet and Structural KnowledgeIEEE Transactions on Image Processing10.1109/TIP.2017.271718526:10(4871-4884)Online publication date: Oct-2017
  • (2016)Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning2016 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2016.7727598(3130-3137)Online publication date: Jul-2016

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media