Conference item
CoTracker: it is better to track together
- Abstract:
- We introduce CoTracker, a transformer-based model that tracks a large number of 2D points in long video sequences. Differently from most existing approaches that track points independently, CoTracker tracks them jointly, accounting for their dependencies. We show that joint tracking significantly improves tracking accuracy and robustness, and allows CoTracker to track occluded points and points outside of the camera view. We also introduce several innovations for this class of trackers, including using token proxies that significantly improve memory efficiency and allow CoTracker to track 70k points jointly and simultaneously at inference on a single GPU. CoTracker is an online algorithm that operates causally on short windows. However, it is trained utilizing unrolled windows as a recurrent network, maintaining tracks for long periods of time even when points are occluded or leave the field of view. Quantitatively, CoTracker substantially outperforms prior trackers on standard point-tracking benchmarks. Code and model weights are available at https://co-tracker.github.io/.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 9.8MB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-031-73033-7_2
Authors
+ European Research Council
More from this funder
- Funder identifier:
- https://ror.org/0472cxd90
- Grant:
- 101001212
- Programme:
- ERC-CoG UNION
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/T028572/1
- Programme:
- VisualAI
- Publisher:
- Springer
- Host title:
- Computer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXII
- Pages:
- 18-35
- Series:
- Lecture Notes in Computer Science
- Series number:
- 15120
- Place of publication:
- Cham, Switzerland
- Publication date:
- 2024-10-31
- Acceptance date:
- 2024-07-01
- Event title:
- 18th European Conference on Computer Vision (ECCV 2024)
- Event location:
- Milan, Italy
- Event website:
- https://eccv.ecva.net/
- Event start date:
- 2024-09-29
- Event end date:
- 2024-10-04
- DOI:
- EISSN:
-
1611-3349
- ISSN:
-
0302-9743
- EISBN:
- 9783031730337
- ISBN:
- 9783031730320
- Language:
-
English
- Keywords:
- Pubs id:
-
2058152
- Local pid:
-
pubs:2058152
- Deposit date:
-
2024-11-29
Terms of use
- Copyright holder:
- Karaev et al.
- Copyright date:
- 2025
- Rights statement:
- © 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG.
- Notes:
- This is the accepted manuscript version of the paper. The final version is available online from Springer at https://dx.doi.org/10.1007/978-3-031-73033-7_2
If you are the owner of this record, you can report an update to it here: Report update to this record