Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild

Zhang, Kaifeng; Fu, Yang; Borse, Shubhankar; Cai, Hong; Porikli, Fatih; Wang, Xiaolong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.07199 (cs)

[Submitted on 13 Oct 2022 (v1), last revised 3 Apr 2023 (this version, v3)]

Title:Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild

Authors:Kaifeng Zhang, Yang Fu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang

View PDF

Abstract:While 6D object pose estimation has wide applications across computer vision and robotics, it remains far from being solved due to the lack of annotations. The problem becomes even more challenging when moving to category-level 6D pose, which requires generalization to unseen instances. Current approaches are restricted by leveraging annotations from simulation or collected from humans. In this paper, we overcome this barrier by introducing a self-supervised learning approach trained directly on large-scale real-world object videos for category-level 6D pose estimation in the wild. Our framework reconstructs the canonical 3D shape of an object category and learns dense correspondences between input images and the canonical shape via surface embedding. For training, we propose novel geometrical cycle-consistency losses which construct cycles across 2D-3D spaces, across different instances and different time steps. The learned correspondence can be applied for 6D pose estimation and other downstream tasks such as keypoint transfer. Surprisingly, our method, without any human annotations or simulators, can achieve on-par or even better performance than previous supervised or semi-supervised methods on in-the-wild images. Our project page is: this https URL .

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2210.07199 [cs.CV]
	(or arXiv:2210.07199v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.07199

Submission history

From: Kaifeng Zhang [view email]
[v1] Thu, 13 Oct 2022 17:19:22 UTC (32,138 KB)
[v2] Sun, 12 Mar 2023 09:05:30 UTC (5,246 KB)
[v3] Mon, 3 Apr 2023 05:35:31 UTC (5,246 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators