End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks
Author(s)
Wang, Sen
Clark, Ronald
Wen, Hongkai
Trigoni, Niki
Type
Journal Article
Abstract
This paper studies visual odometry (VO) from the perspective of deep learning. After tremendous efforts in the robotics and computer vision communities over the past few decades, state-of-the-art VO algorithms have demonstrated incredible performance. However, since the VO problem is typically formulated as a pure geometric problem, one of the key features still missing from current VO systems is the capability to automatically gain knowledge and improve performance through learning. In this paper, we investigate whether deep neural networks can be effective and beneficial to the VO problem. An end-to-end, sequence-to-sequence probabilistic visual odometry (ESP-VO) framework is proposed for the monocular VO based on deep recurrent convolutional neural networks. It is trained and deployed in an end-to-end manner, that is, directly inferring poses and uncertainties from a sequence of raw images (video) without adopting any modules from the conventional VO pipeline. It can not only automatically learn effective feature representation encapsulating geometric information through convolutional neural networks, but also implicitly model sequential dynamics and relation for VO using deep recurrent neural networks. Uncertainty is also derived along with the VO estimation without introducing much extra computation. Extensive experiments on several datasets representing driving, flying and walking scenarios show competitive performance of the proposed ESP-VO to the state-of-the-art methods, demonstrating a promising potential of the deep learning technique for VO and verifying that it can be a viable complement to current VO systems.
Date Issued
2018-04-01
Date Acceptance
2016-06-18
Citation
International Journal of Robotics Research, 2018, 37 (4-5), pp.513-542
ISSN
0278-3649
Publisher
SAGE Publications
Start Page
513
End Page
542
Journal / Book Title
International Journal of Robotics Research
Volume
37
Issue
4-5
Copyright Statement
© The Author(s) 2017. Published by Sage Publications Ltd.
Identifier
http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000432134700008&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=1ba7043ffcc86c417c072aa74d649202
Subjects
Science & Technology
Technology
Robotics
Visual odometry
pose estimation
uncertainty
deep learning
recurrent convolutional neural networks
CAMERA
SLAM
REPRESENTATION
RECOGNITION
DATASET
VISION
STEREO
MEMORY
Publication Status
Published
Coverage Spatial
Univ Michigan, Ann Arbor, MI
Date Publish Online
2017-10-16