Conference item icon

Conference item

Open-world object counting in videos

Abstract:
We introduce a new task of open-world object counting in videos: given a text description, or an image example, that specifies the target object, the objective is to enumerate all the unique instances of the target objects in the video. This task is especially challenging in crowded scenes with occlusions and objects of similar appearance, where avoiding double counting and identifying reappearances is crucial. To this end, we make the following contributions: we introduce a model, COUNTVID, for this task. It leverages an imagebased counting model, and a promptable video segmentation and tracking model, to enable automated open-world object counting across video frames. To evaluate its performance, we introduce VIDEOCOUNT, a new dataset for this novel task built from the TAO and MOT20 tracking datasets, as well as from videos of penguins and metal alloy crystallization captured by x-rays. Using this dataset, we demonstrate that COUNTVID provides accurate object counts, and significantly outperforms strong baselines.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1609/aaai.v40i4.37214

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0002-8945-8573


More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/T028572/1


Publisher:
American Association for the Advancement of Science
Host title:
Proceedings of the 40th Annual AAAI Conference on Artificial Intelligence
Journal:
Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
Volume:
40
Issue:
4
Pages:
2300-2308
Publication date:
2026-03-14
Acceptance date:
2025-11-07
Event title:
40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026)
Event location:
Singapore
Event website:
https://aaai.org/conference/aaai/aaai-26/
Event start date:
2026-01-20
Event end date:
2026-01-27
DOI:
ISSN:
2159-5399
ISBN-10:
1577359062
ISBN-13:
9781577359067


Language:
English
Pubs id:
2357761
Local pid:
pubs:2357761
Deposit date:
2026-01-12
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP