Conference item icon

Conference item

Moving object segmentation: all you need is SAM (and flow)

Abstract:

The objective of this paper is motion segmentation – discovering and segmenting the moving objects in a video. This is a much studied area with numerous careful, and sometimes complex, approaches and training schemes including: self-supervised learning, learning from synthetic datasets, object-centric representations, amodal representations, and many more. Our interest in this paper is to determine if the Segment Anything model (SAM) can contribute to this task. We investigate two models for combining SAM with optical flow that harness the segmentation power of SAM with the ability of flow to discover and group moving objects. In the first model, we adapt SAM to take optical flow, rather than RGB, as an input. In the second, SAM takes RGB as an input, and flow is used as a segmentation prompt. These surprisingly simple methods, without any further modifications, outperform all previous approaches by a considerable margin in both single and multiobject benchmarks. We also extend these frame-level segmentations to sequence-level segmentations that maintain object identity. Again, this simple model achieves outstanding performance across multiple moving object segmentation benchmarks.

Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1007/978-981-96-0972-7_17

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573


More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/T028572/1


Publisher:
Springer
Host title:
Computer Vision – ACCV 2024
Series:
Lecture Notes in Computer Science
Series number:
15481
Publication date:
2024-12-10
Acceptance date:
2024-09-20
Event title:
27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024)
Event location:
Marrakesh, Morocco
Event website:
https://conferences.miccai.org/2024/en/
Event start date:
2024-10-06
Event end date:
2024-10-10
DOI:
EISSN:
1611-3349
ISSN:
0302-9743
EISBN:
9789819609727
ISBN:
9789819609710


Language:
English
Keywords:
Pubs id:
2063475
Local pid:
pubs:2063475
Deposit date:
2024-11-19

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP