Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • About
  • Communities & Collections
  • Advanced Search
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Computing
  4. Computing
  5. SIMstack: a generative shape and instance model for unordered object stacks
 
  • Details
SIMstack: a generative shape and instance model for unordered object stacks
File(s)
2103.16442v2.pdf (107.65 MB)
Accepted version
Author(s)
Landgraf, Zoe
Scona, Raluca
Laidlow, Tristan
James, Stephen
Leutenegger, Stefan
more
Type
Conference Paper
Abstract
By estimating 3D shape and instances from a single view, we can capture information about an environment quickly, without the need for comprehensive scanning and multi-view fusion. Solving this task for composite scenes (such as object stacks) is challenging: occluded areas are not only ambiguous in shape but also in instance segmentation; multiple decompositions could be valid. We observe that physics constrains decomposition as well as shape in occluded regions and hypothesise that a latent space learned from scenes built under physics simulation can serve as a prior to better predict shape and instances in occluded regions. To this end we propose SIMstack, a depth-conditioned Variational Auto-Encoder (VAE), trained on a dataset of objects stacked under physics simulation. We formulate instance segmentation as a centre voting task which allows for class-agnostic detection and doesn’t require setting the maximum number of objects in the scene. At test time, our model can generate 3D shape and instance segmentation from a single depth view, probabilistically sampling proposals for the occluded region from the learned latent space. Our method has practical applications in providing robots some of the ability humans have to make rapid intuitive inferences of partially observed scenes. We demonstrate an application for precise (non-disruptive) object grasping of unknown objects from a single depth view.
Date Issued
2022-02-28
Date Acceptance
2021-10-10
Citation
2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2022
URI
http://hdl.handle.net/10044/1/97140
URL
https://ieeexplore.ieee.org/document/9710412
DOI
https://www.dx.doi.org/10.1109/iccv48922.2021.01277
Publisher
IEEE
Journal / Book Title
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Engineering & Physical Science Research Council (EPSRC)
Dyson Technology Limited
Dyson Technology Limited
Identifier
https://ieeexplore.ieee.org/document/9710412
Grant Number
EP/S036636/1
PO4500503359
PO 4500501004
Source
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Subjects
cs.CV
cs.CV
Publication Status
Published
Start Date
2021-10-10
Finish Date
2021-10-17
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback