Journal article icon

Journal article

Learning an alphabet of shape and appearance for multi-class object detection

Abstract:
We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors—basic shape features are represented by boundary-fragments, and appearance is represented by patches—where each descriptor in combination with centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning the categories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achieve the lowest error rates that have been reported so far.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1007/s11263-008-0139-3

Authors


More by this author
Institution:
Graz University of Technology, Graz, Austria
Department:
Institute of Electrical Measurement and Measurement Signal Processing
Role:
Author
More by this author
Institution:
Graz University of Technology, Graz, Austria
Department:
Institute of Electrical Measurement and Measurement Signal Processing
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author


Publisher:
Springer US
Journal:
International Journal of Computer Vision More from this journal
Volume:
80
Issue:
1
Pages:
16-44
Publication date:
2008-01-01
Edition:
Publisher's version
DOI:
EISSN:
1573-1405
ISSN:
0920-5691


Language:
English
Keywords:
Subjects:
UUID:
uuid:5de09618-b933-444f-86f7-b48bf0cb98b1
Local pid:
ora:5837
Deposit date:
2011-10-31

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP