Conference item
Learning to count objects in images
- Abstract:
- We propose a new supervised learning framework for visual object counting tasks, such as estimating the number of cells in a microscopic image or the number of humans in surveillance video frames. We focus on the practically-attractive case when the training images are annotated with dots (one dot per object). Our goal is to accurately estimate the count. However, we evade the hard task of learning to detect and localize individual object instances. Instead, we cast the problem as that of estimating an image density whose integral over any image region gives the count of objects within that region. Learning to infer such density can be formulated as a minimization of a regularized risk quadratic cost function. We introduce a new loss function, which is well-suited for such learning, and at the same time can be computed efficiently via a maximum subarray algorithm. The learning can then be posed as a convex quadratic program solvable with cutting-plane optimization. The proposed framework is very flexible as it can accept any domain-specific visual features. Once trained, our system provides accurate object counts and requires a very small time overhead over the feature extraction step, making it a good candidate for applications involving real-time processing or dealing with huge amount of visual data.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 501.8KB, Terms of use)
-
Authors
+ European Research Council
More from this funder
- Funder identifier:
- https://ror.org/0472cxd90
- Grant:
- 228180
- Programme:
- VisRec
- Publisher:
- Curran Associates
- Host title:
- Advances in Neural Information Processing Systems 23
- Volume:
- 1
- Pages:
- 1324 -1332
- Publication date:
- 2011-06-01
- Acceptance date:
- 2010-08-31
- Event title:
- 24th Annual Conference on Neural Information Processing Systems 2010 (NIPS 2010)
- Event location:
- Vancouver, BC, Canada
- Event website:
- https://nips.cc/Conferences/2010
- Event start date:
- 2010-12-06
- Event end date:
- 2010-12-09
- ISSN:
-
1049-5258
- ISBN:
- 9781617823800
- Language:
-
English
- Pubs id:
-
327025
- Local pid:
-
pubs:327025
- Deposit date:
-
2024-07-23
- ARK identifier:
Terms of use
- Copyright holder:
- Neural Information Processing Systems
- Copyright date:
- 2010
- Rights statement:
- © (2010) by Neural Information Processing Systems. All rights reserved.
- Notes:
- This is the accepted manuscript version of the paper. The final version is available online from Curran Associates at https://www.proceedings.com/10901.html.
If you are the owner of this record, you can report an update to it here: Report update to this record