Conference item
Moving object segmentation: all you need is SAM (and flow)
- Abstract:
-
The objective of this paper is motion segmentation – discovering and segmenting the moving objects in a video. This is a much studied area with numerous careful, and sometimes complex, approaches and training schemes including: self-supervised learning, learning from synthetic datasets, object-centric representations, amodal representations, and many more. Our interest in this paper is to determine if the Segment Anything model (SAM) can contribute to this task. We investigate two models for combining SAM with optical flow that harness the segmentation power of SAM with the ability of flow to discover and group moving objects. In the first model, we adapt SAM to take optical flow, rather than RGB, as an input. In the second, SAM takes RGB as an input, and flow is used as a segmentation prompt. These surprisingly simple methods, without any further modifications, outperform all previous approaches by a considerable margin in both single and multiobject benchmarks. We also extend these frame-level segmentations to sequence-level segmentations that maintain object identity. Again, this simple model achieves outstanding performance across multiple moving object segmentation benchmarks.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Authors
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/T028572/1
- Publisher:
- Springer
- Host title:
- Computer Vision – ACCV 2024
- Series:
- Lecture Notes in Computer Science
- Series number:
- 15481
- Publication date:
- 2024-12-10
- Acceptance date:
- 2024-09-20
- Event title:
- 27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024)
- Event location:
- Marrakesh, Morocco
- Event website:
- https://conferences.miccai.org/2024/en/
- Event start date:
- 2024-10-06
- Event end date:
- 2024-10-10
- DOI:
- EISSN:
-
1611-3349
- ISSN:
-
0302-9743
- EISBN:
- 9789819609727
- ISBN:
- 9789819609710
- Language:
-
English
- Keywords:
- Pubs id:
-
2063475
- Local pid:
-
pubs:2063475
- Deposit date:
-
2024-11-19
Terms of use
- Copyright holder:
- Xie et al.
- Copyright date:
- 2025
- Rights statement:
- © 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
If you are the owner of this record, you can report an update to it here: Report update to this record