Conference item
Counting in the wild
- Abstract:
- In this paper we explore the scenario of learning to count multiple instances of objects from images that have been dot-annotated through crowdsourcing. Specifically, we work with a large and challenging image dataset of penguins in the wild, for which tens of thousands of volunteer annotators have placed dots on instances of penguins in tens of thousands of images. The dataset, introduced and released with this paper, shows such a high-degree of object occlusion and scale variation that individual object detection or simple counting-density estimation is not able to estimate the bird counts reliably. To address the challenging counting task, we augment and interleave density estimation with foreground-background segmentation and explicit local uncertainty estimation. The three tasks are solved jointly by a new deep multi-task architecture. Using this multi-task learning, we show that the spread between the annotators can provide hints about local object scale and aid the foreground-background segmentation, which can then be used to set a better target density for learning density prediction. Considerable improvements in counting accuracy over a single-task density estimation approach are observed in our experiments.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 4.9MB, Terms of use)
-
Authors
- Publisher:
- Springer, Cham
- Host title:
- ECCV 2016: Computer Vision – ECCV 2016
- Journal:
- ECCV 2016: Computer Vision – ECCV 2016 More from this journal
- Pages:
- 483-498
- Series:
- Lecture Notes in Computer Science (Volume 9911)
- Publication date:
- 2014-03-24
- Acceptance date:
- 2016-03-14
- Event location:
- Amsterdam
- Keywords:
- Pubs id:
-
pubs:656447
- UUID:
-
uuid:9bb7cbd9-42fc-416a-9cca-dd07c25f9fb1
- Local pid:
-
pubs:656447
- Deposit date:
-
2016-11-01
- ARK identifier:
Terms of use
- Copyright holder:
- Springer International Publishing AG
- Copyright date:
- 2014
- Notes:
-
This is an
accepted manuscript of a book chapter published by Springer in ECCV 2016: Computer Vision – ECCV 2016 on 2016-09-16, available online: http://dx.doi.org/10.1007/978-3-319-46478-7_30
If you are the owner of this record, you can report an update to it here: Report update to this record