Conference item icon

Conference item

GENESIS-V2: inferring unordered object representations without iterative refinement

Abstract:
Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale well to large images or iterative refinement which avoids imposing an unnatural ordering on objects in an image but requires the a priori initialisation of a fixed number of object representations. In contrast to established paradigms, this work proposes an embedding-based approach in which embeddings of pixels are clustered in a differentiable fashion using a stochastic stick-breaking process. Similar to iterative refinement, this clustering procedure also leads to randomly ordered object representations, but without the need of initialising a fixed number of clusters a priori. This is used to develop a new model, GENESIS-v2, which can infer a variable number of object representations without using RNNs or iterative refinement. We show that GENESIS-v2 performs strongly in comparison to recent baselines in terms of unsupervised image segmentation and object-centric scene generation on established synthetic datasets as well as more complex real-world datasets.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Pembroke College
Role:
Author


More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/M019918/1


Publisher:
Curran Associates
Host title:
Advances in Neural Information Processing Systems 34 (NeurIPS 2021)
Volume:
10
Pages:
8085-8094
Publication date:
2022-05-01
Acceptance date:
2021-04-21
Event title:
35th Conference on Neural Information Processing Systems (NeurIPS 2021)
Event location:
Virtual event
Event website:
https://nips.cc/Conferences/2021/
Event start date:
2021-12-06
Event end date:
2021-12-14
EISSN:
2218-6581
ISBN:
9781713845393


Language:
English
Pubs id:
1173572
Local pid:
pubs:1173572
Deposit date:
2021-04-26

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP