Preprint icon

Preprint

Contrastive lift: 3D object instance segmentation by slow-fast contrastive fusion

Abstract:
Instance segmentation in 3D is a challenging task due to the lack of large-scale annotated datasets. In this paper, we show that this task can be addressed effectively by leveraging instead 2D pre-trained models for instance segmentation. We propose a novel approach to lift 2D segments to 3D and fuse them by means of a neural field representation, which encourages multi-view consistency across frames. The core of our approach is a slow-fast clustering objective function, which is scalable and well-suited for scenes with a large number of objects. Unlike previous approaches, our method does not require an upper bound on the number of objects or object tracking across frames. To demonstrate the scalability of the slow-fast clustering, we create a new semi-realistic dataset called the Messy Rooms dataset, which features scenes with up to 500 objects per scene. Our approach outperforms the state-of-the-art on challenging scenes from the ScanNet, Hypersim, and Replica datasets, as well as on our newly created Messy Rooms dataset, demonstrating the effectiveness and scalability of our slow-fast clustering method.
Publication status:
Published
Peer review status:
Not peer reviewed

Actions


Access Document


Preprint server copy:
10.48550/arxiv.2306.04633

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Research group:
Visual Geometry Group
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Research group:
Visual Geometry Group
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Research group:
Visual Geometry Group
Role:
Author
ORCID:
0000-0002-2478-2102
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Research group:
Visual Geometry Group
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Research group:
Visual Geometry Group
Oxford college:
New College
Role:
Author
ORCID:
0000-0003-1374-2858


More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/T028572/1


Preprint server:
arXiv
Publication date:
2023-06-07
DOI:


Language:
English
Pubs id:
1510141
Local pid:
pubs:1510141
Deposit date:
2024-06-13

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP