FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving

Liu, Yuxuan; Xu, Zhenhua; Huang, Huaiyang; Wang, Lujia; Liu, Ming

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.10719 (cs)

[Submitted on 21 Apr 2023]

Title:FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving

Authors:Yuxuan Liu, Zhenhua Xu, Huaiyang Huang, Lujia Wang, Ming Liu

View PDF

Abstract:Predicting accurate depth with monocular images is important for low-cost robotic applications and autonomous driving. This study proposes a comprehensive self-supervised framework for accurate scale-aware depth prediction on autonomous driving scenes utilizing inter-frame poses obtained from inertial measurements. In particular, we introduce a Full-Scale depth prediction network named FSNet. FSNet contains four important improvements over existing self-supervised models: (1) a multichannel output representation for stable training of depth prediction in driving scenarios, (2) an optical-flow-based mask designed for dynamic object removal, (3) a self-distillation training strategy to augment the training process, and (4) an optimization-based post-processing algorithm in test time, fusing the results from visual odometry. With this framework, robots and vehicles with only one well-calibrated camera can collect sequences of training image frames and camera poses, and infer accurate 3D depths of the environment without extra labeling work or 3D data. Extensive experiments on the KITTI dataset, KITTI-360 dataset and the nuScenes dataset demonstrate the potential of FSNet. More visualizations are presented in \url{this https URL}

Comments:	12 pages. conditionally accepted by IEEE T-ASE
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.10719 [cs.CV]
	(or arXiv:2304.10719v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.10719

Submission history

From: Yuxuan Liu [view email]
[v1] Fri, 21 Apr 2023 03:17:04 UTC (2,936 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators