skip to main content
10.1145/2964284.2967182acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing Retrieval

Published: 01 October 2016 Publication History

Abstract

In this paper, we address two practical problems when shopping online: 1) What will I look like when wearing this clothing on the street? 2) How to find the exact same or similar clothing that other people are wearing on the street or in a movie? In this paper, we jointly solve these two problems with one bi-directional shop-to-street street-to-shop clothing retrieval framework. There are three main challenges of cross-domain clothing retrieval task. First is to learn the discrepancy (e.g., background, pose, illumination) between street domain and shop domain clothing. Second, both intra-domain and cross-domain similarity need to be considered during feature embedding. Third, there is large bias between the number of matched and non-matched street and shop pairs. To solve these challenges, in this paper, we propose a deep bi-directional cross-triplet embedding algorithm by extending the start-of-the-art triplet embedding into cross-domain retrieval scenario. Extensive experiments demonstrate the effectiveness of the proposed algorithm.

References

[1]
S. Bell and K. Bala. Learning visual similarity for product design with convolutional neural networks. ACM Transactions on Graphics, 34(4):98, 2015.
[2]
J. Bromley, J. W. Bentz, L. Bottou, I. Guyon, Y. LeCun, C. Moore, E. Sackinger, and R. Shah. Signature verification using a "siamese" time delay neural network. International Journal of Pattern Recognition and Artificial Intelligence, 7(04):669--688, 1993.
[3]
Q. Chen, J. Huang, R. Feris, L. M. Brown, J. Dong, and S. Yan. Deep domain adaptation for describing people based on fine-grained clothing attributes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5315--5324, 2015.
[4]
W. Di, C. Wah, A. Bhardwaj, R. Piramuthu, and N. Sundaresan. Style finder: Fine-grained clothing style detection and retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 8--13, 2013.
[5]
J. Fu, J. Wang, Z. Li, M. Xu, and H. Lu. Efficient clothing retrieval with semantic-preserving visual phrases. In Proceedings of the Asian Conference on Computer Vision, pages 420--431. Springer, 2012.
[6]
M. Hadi Kiapour, X. Han, S. Lazebnik, A. C. Berg, and T. L. Berg. Where to buy it: Matching street clothing photos in online shops. In Proceedings of the IEEE International Conference on Computer Vision, pages 3343--3351, 2015.
[7]
R. Hadsell, S. Chopra, and Y. LeCun. Dimensionality reduction by learning an invariant mapping. In Proceedings of the IEEE computer society conference on Computer vision and pattern recognition, volume 2, pages 1735--1742. IEEE, 2006.
[8]
T. Iwata, S. Wanatabe, and H. Sawada. Fashion coordinates recommender system using photographs from fashion magazines. In International Joint Conference on Artificial Intelligence, volume 22, page 2262. Citeseer, 2011.
[9]
V. Jagadeesh, R. Piramuthu, A. Bhardwaj, W. Di, and N. Sundaresan. Large scale visual recommendations from street fashion images. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1925--1934. ACM, 2014.
[10]
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia, pages 675--678. ACM, 2014.
[11]
S. Jiang, M. Shao, C. Jia, and Y. Fu. Consensus style centralizing auto-encoder for weak style classification. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. AAAI, 2016.
[12]
Y. Kalantidis, L. Kennedy, and L.-J. Li. Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, pages 105--112. ACM, 2013.
[13]
M. H. Kiapour, K. Yamaguchi, A. C. Berg, and T. L. Berg. Hipster wars: Discovering elements of fashion styles. In Proceedings of the European Conference on Computer Vision, pages 472--488. Springer, 2014.
[14]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pages 1097--1105, 2012.
[15]
S. Liu, J. Feng, Z. Song, T. Zhang, H. Lu, C. Xu, and S. Yan. Hi, magic closet, tell me what to wear! In Proceedings of the 20th ACM International Conference on Multimedia, pages 619--628. ACM, 2012.
[16]
S. Liu, X. Liang, L. Liu, X. Shen, J. Yang, C. Xu, L. Lin, X. Cao, and S. Yan. Matching-cnn meets knn: quasi-parametric human parsing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1419--1427, 2015.
[17]
S. Liu, Z. Song, G. Liu, C. Xu, H. Lu, and S. Yan. Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3330--3337. IEEE, 2012.
[18]
K. Saenko, B. Kulis, M. Fritz, and T. Darrell. Adapting visual category models to new domains. In European conference on computer vision, pages 213--226. Springer, 2010.
[19]
F. Schroff, D. Kalenichenko, and J. Philbin. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 815--823, 2015.
[20]
H. O. Song, Y. Xiang, S. Jegelka, and S. Savarese. Deep metric learning via lifted structured feature embedding. arXiv preprint arXiv:1511.06452, 2015.
[21]
A. Veit, B. Kovacs, S. Bell, J. McAuley, K. Bala, and S. Belongie. Learning visual clothing style with heterogeneous dyadic co-occurrences. In Proceedings of the IEEE International Conference on Computer Vision, pages 4642--4650, 2015.
[22]
S. Vittayakorn, K. Yamaguchi, A. C. Berg, and T. L. Berg. Runway to realway: Visual analysis of fashion. In IEEE Winter Conference on Applications of Computer Vision, pages 951--958. IEEE, 2015.
[23]
J. Wang, Y. Song, T. Leung, C. Rosenberg, J. Wang, J. Philbin, B. Chen, and Y. Wu. Learning fine-grained image similarity with deep ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1386--1393, 2014.
[24]
K. Q. Weinberger and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. The Journal of Machine Learning Research, 10:207--244, 2009.
[25]
K. Yamaguchi, M. Kiapour, and T. Berg. Paper doll parsing: Retrieving similar styles to parse clothing items. In Proceedings of the IEEE International Conference on Computer Vision, pages 3519--3526, 2013.
[26]
K. Yamaguchi, M. H. Kiapour, L. E. Ortiz, and T. L. Berg. Parsing clothing in fashion photographs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3570--3577. IEEE, 2012.

Cited By

View all

Index Terms

  1. Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing Retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '16: Proceedings of the 24th ACM international conference on Multimedia
    October 2016
    1542 pages
    ISBN:9781450336031
    DOI:10.1145/2964284
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clothing retrieval
    2. cross domain
    3. triplet embedding

    Qualifiers

    • Short-paper

    Funding Sources

    • ONR Young Investigator Award
    • ONR award
    • MIT Lincoln Labs Grant
    • U.S. Army Research Office Young Investigator Award

    Conference

    MM '16
    Sponsor:
    MM '16: ACM Multimedia Conference
    October 15 - 19, 2016
    Amsterdam, The Netherlands

    Acceptance Rates

    MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)BangleFIR: bridging the gap in fashion image retrieval with a novel dataset of banglesMultimedia Tools and Applications10.1007/s11042-024-19698-4Online publication date: 10-Jul-2024
    • (2023)A Survey on Fashion Image RetrievalACM Computing Surveys10.1145/363655256:6(1-25)Online publication date: 13-Dec-2023
    • (2023)Study of AI-Driven Fashion Recommender SystemsSN Computer Science10.1007/s42979-023-01932-94:5Online publication date: 5-Jul-2023
    • (2023)Ornament image retrieval using few-shot learningInternational Journal of Multimedia Information Retrieval10.1007/s13735-023-00299-012:2Online publication date: 31-Aug-2023
    • (2022)Would Your Clothes Look Good on Me? Towards Transferring Clothing Styles with Adaptive Instance NormalizationSensors10.3390/s2213500222:13(5002)Online publication date: 2-Jul-2022
    • (2022)Feature Norm-Based Deep Network for Multi-Domain Fashion Image RetrievalAATCC Journal of Research10.14504/ajr.8.S1.268:1_suppl(219-228)Online publication date: 28-Mar-2022
    • (2022)A Multi-Task Model for Multi-Attribute Fashion Recognition and RetrievalAATCC Journal of Research10.14504/ajr.8.S1.148:1_suppl(105-116)Online publication date: 28-Mar-2022
    • (2022)Effort-Aware Just-in-Time Bug Prediction for Mobile Apps Via Cross-Triplet Deep Feature EmbeddingIEEE Transactions on Reliability10.1109/TR.2021.306617071:1(204-220)Online publication date: Mar-2022
    • (2022)Deep Learning for Fashion Style GenerationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.305789233:9(4538-4550)Online publication date: Sep-2022
    • (2022)Deep Learning Approaches for Fashion Knowledge Extraction From Social Media: A ReviewIEEE Access10.1109/ACCESS.2021.313789310(1545-1576)Online publication date: 2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media