Conference item
GENESIS-V2: inferring unordered object representations without iterative refinement
- Abstract:
- Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale well to large images or iterative refinement which avoids imposing an unnatural ordering on objects in an image but requires the a priori initialisation of a fixed number of object representations. In contrast to established paradigms, this work proposes an embedding-based approach in which embeddings of pixels are clustered in a differentiable fashion using a stochastic stick-breaking process. Similar to iterative refinement, this clustering procedure also leads to randomly ordered object representations, but without the need of initialising a fixed number of clusters a priori. This is used to develop a new model, GENESIS-v2, which can infer a variable number of object representations without using RNNs or iterative refinement. We show that GENESIS-v2 performs strongly in comparison to recent baselines in terms of unsupervised image segmentation and object-centric scene generation on established synthetic datasets as well as more complex real-world datasets.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Authors
+ Engineering & Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/M019918/1
- Publisher:
- Curran Associates
- Host title:
- Advances in Neural Information Processing Systems 34 (NeurIPS 2021)
- Volume:
- 10
- Pages:
- 8085-8094
- Publication date:
- 2022-05-01
- Acceptance date:
- 2021-04-21
- Event title:
- 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
- Event location:
- Virtual event
- Event website:
- https://nips.cc/Conferences/2021/
- Event start date:
- 2021-12-06
- Event end date:
- 2021-12-14
- EISSN:
-
2218-6581
- ISBN:
- 9781713845393
- Language:
-
English
- Pubs id:
-
1173572
- Local pid:
-
pubs:1173572
- Deposit date:
-
2021-04-26
Terms of use
- Copyright holder:
- Engelcke et al. and NIPS
- Copyright date:
- 2021
- Rights statement:
- Copyright © (2021) by individual authors and Neural Information Processing Systems Foundation Inc. All rights reserved.
- Notes:
- This is the accepted manuscript version of the paper. The final version is available from the Neural Information Processing Systems Foundation at: https://proceedings.neurips.cc/paper/2021/hash/43ec517d68b6edd3015b3edc9a11367b-Abstract.html
If you are the owner of this record, you can report an update to it here: Report update to this record