4
IRUS TotalDownloads
Altmetric
Computing CNN loss and gradients for pose estimation with Riemannian geometry
File | Description | Size | Format | |
---|---|---|---|---|
hou2018miccai.pdf | Accepted version | 679.14 kB | Adobe PDF | View/Open |
Title: | Computing CNN loss and gradients for pose estimation with Riemannian geometry |
Authors: | Hou, B Miolane, N Khanal, B Lee, M Alansary, A McDonagh, SG Hajnal, JV Rueckert, D Glocker, B Kainz, B |
Item Type: | Conference Paper |
Abstract: | Pose estimation, i.e. predicting a 3D rigid transformation with respect to a fixed co-ordinate frame in, SE(3), is an omnipresent problem in medical image analysis. Deep learning methods often parameterise poses with a representation that separates rotation and translation. As commonly available frameworks do not provide means to calculate loss on a manifold, regression is usually performed using the L2-norm independently on the rotation’s and the translation’s parameterisations. This is a metric for linear spaces that does not take into account the Lie group structure of SE(3). In this paper, we propose a general Riemannian formulation of the pose estimation problem, and train CNNs directly on SE(3) equipped with a left-invariant Riemannian metric. The loss between the ground truth and predicted pose (elements of the manifold) is calculated as the Riemannian geodesic distance, which couples together the translation and rotation components. Network weights are updated by back-propagating the gradient with respect to the predicted pose on the tangent space of the manifold SE(3). We thoroughly evaluate the effectiveness of our loss function by comparing its performance with popular and most commonly used existing methods, on tasks such as image-based localisation and intensity-based 2D/3D registration. We also show that hyper-parameters, used in our loss function to weight the contribution between rotations and translations, can be intrinsically calculated from the dataset to achieve greater performance margins. |
Issue Date: | 26-Sep-2018 |
Date of Acceptance: | 25-May-2018 |
URI: | http://hdl.handle.net/10044/1/60749 |
DOI: | 10.1007/978-3-030-00928-1_85 |
ISSN: | 0302-9743 |
Publisher: | Springer Verlag |
Start Page: | 756 |
End Page: | 764 |
Journal / Book Title: | Lecture Notes in Computer Science |
Copyright Statement: | © Springer Nature Switzerland AG 2018. The final publication is available at Springer via https://doi.org/10.1007/978-3-030-00928-1_85 |
Sponsor/Funder: | Wellcome Trust/EPSRC Wellcome Trust Engineering & Physical Science Research Council (E Nvidia Engineering & Physical Science Research Council (E Wellcome Trust |
Funder's Grant Number: | NS/A000025/1 RTJ5557761 RTJ5557761-1 Nvidia Hardware donation RTJ5557761-1 PO :RTJ5557761-1 |
Conference Name: | International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) |
Keywords: | Science & Technology Technology Computer Science, Theory & Methods Imaging Science & Photographic Technology Computer Science RECONSTRUCTION METRICS cs.CV cs.CV Artificial Intelligence & Image Processing |
Publication Status: | Published |
Start Date: | 2018-09-16 |
Finish Date: | 2018-09-20 |
Conference Place: | Granada, Spain |
Online Publication Date: | 2018-09-26 |
Appears in Collections: | Computing Faculty of Engineering |