Conference item
Open-world object counting in videos
- Abstract:
- We introduce a new task of open-world object counting in videos: given a text description, or an image example, that specifies the target object, the objective is to enumerate all the unique instances of the target objects in the video. This task is especially challenging in crowded scenes with occlusions and objects of similar appearance, where avoiding double counting and identifying reappearances is crucial. To this end, we make the following contributions: we introduce a model, COUNTVID, for this task. It leverages an imagebased counting model, and a promptable video segmentation and tracking model, to enable automated open-world object counting across video frames. To evaluate its performance, we introduce VIDEOCOUNT, a new dataset for this novel task built from the TAO and MOT20 tracking datasets, as well as from videos of penguins and metal alloy crystallization captured by x-rays. Using this dataset, we demonstrate that COUNTVID provides accurate object counts, and significantly outperforms strong baselines.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 4.3MB, Terms of use)
-
- Publisher copy:
- 10.1609/aaai.v40i4.37214
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/T028572/1
- Publisher:
- American Association for the Advancement of Science
- Host title:
- Proceedings of the 40th Annual AAAI Conference on Artificial Intelligence
- Journal:
- Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
- Volume:
- 40
- Issue:
- 4
- Pages:
- 2300-2308
- Publication date:
- 2026-03-14
- Acceptance date:
- 2025-11-07
- Event title:
- 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026)
- Event location:
- Singapore
- Event website:
- https://aaai.org/conference/aaai/aaai-26/
- Event start date:
- 2026-01-20
- Event end date:
- 2026-01-27
- DOI:
- ISSN:
-
2159-5399
- ISBN-10:
- 1577359062
- ISBN-13:
- 9781577359067
- Language:
-
English
- Pubs id:
-
2357761
- Local pid:
-
pubs:2357761
- Deposit date:
-
2026-01-12
- ARK identifier:
Terms of use
- Copyright holder:
- Association for the Advancement of Artificial Intelligence
- Copyright date:
- 2026
- Rights statement:
- © 2026, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
- Notes:
- This paper was presented at the 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026), 20th-27th January 2026, Singapore. The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record