short-paper

Multi-view Semi-supervised Learning for Web Image Annotation

Authors:

Fuhao ZouAuthors Info & Claims

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Pages 947 - 950

https://doi.org/10.1145/2733373.2806371

Published: 13 October 2015 Publication History

Get Access

Abstract

With the explosive increasing of web image data, image annotation has become a critical research issue for image semantic index and search. In this work, we propose a novel model, termed as multi-view semi-supervised learning (MVSSL), for robust image annotation task. Specifically, we exploit both labeled images and unlabeled images to uncover the intrinsic data structural information. Meanwhile, to comprehensively describe an individual datum, we take advantage of the correlated and complemental information derived from multiple facets of image data (i.e., multiple views or features). We devise a robust pair-wise constraint on outcomes of different views to achieve annotation consistency. Furthermore, we integrate a robust classifier learning component via l_2,1 loss, which can provide effective noise identification power during the learning process. Finally, we devise an efficient iterative algorithm to solve the optimization problem in MVSSL. We conduct extensive experiments on the NUS-WIDE dataset, and the results illustrate that our proposed approach is promising for large scale web image annotation task.

References

[1]

T. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. NUS-WIDE: a real-world web image database from national university of singapore. In CIVR, pages 48:1--48:9, 2009.

Digital Library

Google Scholar

[2]

J. D. R. Farquhar, D. R. Hardoon, H. Meng, J. Shawe-Taylor, and S. Szedm�k. Two view learning: Svm-2k, theory and practice. In NIPS, pages 355--362, 2005.

Google Scholar

[3]

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. B. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM Multimedia, pages 675--678, 2014.

Digital Library

Google Scholar

[4]

D. G. Lowe. Object recognition from local scale-invariant features. In ICCV, pages 1150--1157, 1999.

Digital Library

Google Scholar

[5]

A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. TPAMI, 22(12):1349--1380, 2000.

Digital Library

Google Scholar

[6]

Y. Yang, Y. Yang, Z. Huang, H. T. Shen, and F. Nie. Tag localization with spatial correlations and joint group sparsity. In CVPR, pages 881--888, 2011.

Digital Library

Google Scholar

[7]

Y. Yang, Z. Zha, Y. Gao, X. Zhu, and T. Chua. Exploiting web images for semantic video indexing via robust sample-specific loss. TMM, 16(6):1677--1689, 2014.

Crossref

Google Scholar

[8]

D. Zhang, M. M. Islam, and G. Lu. A review on automatic image annotation techniques. Pattern Recognition, 45(1):346--362, 2012.

Digital Library

Google Scholar

[9]

H. Zhang, Z. Zha, Y. Yang, S. Yan, Y. Gao, and T. Chua. Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. In ACM Multimedia, pages 33--42, 2013.

Digital Library

Google Scholar

[10]

X. Zhu. Semi-supervised learning literature survey. 2005.

Google Scholar

Cited By

View all

Han CShen FLiu LYang YShen HBoll SMu Lee KLuo JZhu WByun HWen Chen CLienhart RMei T(2018)Visual Spatial Attention Network for Relationship DetectionProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240611(510-518)Online publication date: 15-Oct-2018
https://dl.acm.org/doi/10.1145/3240508.3240611
Hu MYang YShen FXie NShen H(2018)Hashing with Angular Reconstructive EmbeddingsIEEE Transactions on Image Processing10.1109/TIP.2017.274914727:2(545-555)Online publication date: Feb-2018
https://doi.org/10.1109/TIP.2017.2749147
Hu MYang YShen FZhang LShen HLi X(2017)Robust Web Image Annotation via Exploring Multi-Facet and Structural KnowledgeIEEE Transactions on Image Processing10.1109/TIP.2017.271718526:10(4871-4884)Online publication date: Oct-2017
https://doi.org/10.1109/TIP.2017.2717185
Show More Cited By

Index Terms

Multi-view Semi-supervised Learning for Web Image Annotation
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Semi-supervised sparse feature selection based on multi-view Laplacian regularization

Semi-supervised sparse feature selection, which can exploit the large number unlabeled data and small number labeled data simultaneously, has placed an important role in web image annotation. However, most of the semi-supervised sparse feature selection ...
Automatic image annotation using semi-supervised generative modeling

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an ...
Robust semi-supervised multi-view graph learning with sharable and individual structure
Highlights
- We propose a novel semi-supervised multi-view learning framework by joint learning both individual and sharable subspace representation, which explores the ...
Abstract
The construction of a high-quality multi-view consensus graph is key to graph-based semi-supervised multi-view learning (GSSMvL) methods. However, most existing GSSMvL methods explore sample relationships in the original multi-view ...

Comments

Information & Contributors

Information

Published In

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

October 2015

1402 pages

ISBN:9781450334594

DOI:10.1145/2733373

General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM '15

Sponsor:

SIGMM

MM '15: ACM Multimedia Conference

October 26 - 30, 2015

Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
302
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Han CShen FLiu LYang YShen HBoll SMu Lee KLuo JZhu WByun HWen Chen CLienhart RMei T(2018)Visual Spatial Attention Network for Relationship DetectionProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240611(510-518)Online publication date: 15-Oct-2018
https://dl.acm.org/doi/10.1145/3240508.3240611
Hu MYang YShen FXie NShen H(2018)Hashing with Angular Reconstructive EmbeddingsIEEE Transactions on Image Processing10.1109/TIP.2017.274914727:2(545-555)Online publication date: Feb-2018
https://doi.org/10.1109/TIP.2017.2749147
Hu MYang YShen FZhang LShen HLi X(2017)Robust Web Image Annotation via Exploring Multi-Facet and Structural KnowledgeIEEE Transactions on Image Processing10.1109/TIP.2017.271718526:10(4871-4884)Online publication date: Oct-2017
https://doi.org/10.1109/TIP.2017.2717185
Li WGao MLi HXiong QWen JWu Z(2016)Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning2016 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2016.7727598(3130-3137)Online publication date: Jul-2016
https://doi.org/10.1109/IJCNN.2016.7727598

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Semi-supervised sparse feature selection based on multi-view Laplacian regularization

Automatic image annotation using semi-supervised generative modeling

Robust semi-supervised multi-view graph learning with sharable and individual structure