3DPointCaps+plus : Learning 3D representations with capsule networks
File(s)s11263-022-01632-6.pdf (2.06 MB)
Published version
Author(s)
Type
Journal Article
Abstract
We present 3DPointCaps++ for learning robust, flexible and generalizable 3D object representations without requiring heavy annotation efforts or supervision. Unlike conventional 3D generative models, our algorithm aims for building a structured latent space where certain factors of shape variations, such as object parts, can be disentangled into independent sub-spaces. Our novel decoder then acts on these individual latent sub-spaces (i.e. capsules) using deconvolution operators to reconstruct 3D points in a self-supervised manner. We further introduce a cluster loss ensuring that the points reconstructed by a single capsule remain local and do not spread across the object uncontrollably. These contributions allow our network to tackle the challenging tasks of part segmentation, part interpolation/replacement as well as correspondence estimation across rigid / non-rigid shape, and across / within category. Our extensive evaluations on ShapeNet objects and human scans demonstrate that our network can learn generic representations that are robust and useful in many applications.
Date Issued
2022-07-30
Date Acceptance
2022-04-27
Citation
International Journal of Computer Vision, 2022, 130 (9), pp.2321-2336
ISSN
0920-5691
Publisher
Springer
Start Page
2321
End Page
2336
Journal / Book Title
International Journal of Computer Vision
Volume
130
Issue
9
Copyright Statement
© The Author(s) 2022. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
License URL
Identifier
https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000833499200001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=1ba7043ffcc86c417c072aa74d649202
Subjects
3D Point clouds
3D Reconstruction
3D Shapes
Autoencoder
Capsule networks
Computer Science
Computer Science, Artificial Intelligence
Representation learning
Science & Technology
Technology
Unsupervised learning
Publication Status
Published
Date Publish Online
2022-07-30