Identifiable object representations under spatial ambiguities
File(s)kori25a.pdf (2.51 MB)
Published version
Author(s)
Kori, A
Toni, F
Glocker, B
Type
Conference Paper
Abstract
Modular object-centric representations are essential for human-like reasoning but are challenging to obtain under spatial ambiguities, e.g. due to occlusions and view ambiguities. However, addressing challenges presents both theoretical and practical difficulties. We introduce a novel multi-view probabilistic approach that aggregates view-specific slots to capture invariant content information while simultaneously learning disentangled global viewpoint-level information. Unlike prior single-view methods, our approach resolves spatial ambiguities, provides theoretical guarantees for identifiability, and requires no viewpoint annotations. Extensive experiments on standard benchmarks and novel complex datasets validate our method’s robustness and scalability.
Date Issued
2025-07-13
Date Acceptance
2025-07-01
Citation
Proceedings of Machine Learning Research, 2025, 267, pp.31486-31518
ISSN
2640-3498
Publisher
MLResearchPress
Start Page
31486
End Page
31518
Journal / Book Title
Proceedings of Machine Learning Research
Volume
267
Copyright Statement
© The authors and PMLR 2026. MLResearchPress
Source
International Conference on Machine Learning 2025
Publication Status
Published
Start Date
2025-07-13
Finish Date
2025-07-19
Coverage Spatial
Vancouver, Canada