Conference item icon

Conference item

Multiple kernels for object detection

Abstract:

Our objective is to obtain a state-of-the art object category detector by employing a state-of-the-art image classifier to search for the object in all possible image sub-windows. We use multiple kernel learning of Varma and Ray (ICCV 2007) to learn an optimal combination of exponential χ2 kernels, each of which captures a different feature channel. Our features include the distribution of edges, dense and sparse visual words, and feature descriptors at different levels of spatial organization. Such a powerful classifier cannot be tested on all image sub-windows in a reasonable amount of time. Thus we propose a novel three-stage classifier, which combines linear, quasi-linear, and non-linear kernel SVMs. We show that increasing the non-linearity of the kernels increases their discriminative power, at the cost of an increased computational complexity. Our contributions include (i) showing that a linear classifier can be evaluated with a complexity proportional to the number of sub-windows (independent of the sub-window area and descriptor dimension); (ii) a comparison of three efficient methods of proposing candidate regions (including the jumping window classifier of Chum and Zisserman (CVPR 2007) based on proposing windows from scale invariant features); and (Hi) introducing overlap-recall curves as a mean to compare and optimize the performance of the intermediate pipeline stages. The method is evaluated on the PASCAL Visual Object Detection Challenge, and exceeds the performances of previously published methods for most of the classes.

Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1109/iccv.2009.5459183

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
New College
Role:
Author
ORCID:
0000-0003-1374-2858
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573


More from this funder
Funder identifier:
https://ror.org/0472cxd90
Grant:
228180
More from this funder
Funder identifier:
https://ror.org/0526snb40


Publisher:
IEEE
Host title:
2009 IEEE 12th International Conference on Computer Vision
Pages:
606-613
Publication date:
2010-05-06
Event title:
12th IEEE International Conference on Computer Vision (ICCV 2009)
Event location:
Kyoto, Japan
Event website:
https://www.computer.org/csdl/proceedings/iccvw/2009/12OmNxwWorE
Event start date:
2009-09-27
Event end date:
2009-10-04
DOI:
EISSN:
2380-7504
ISSN:
1550-5499
ISBN:
9781424444205


Language:
English
Keywords:
Pubs id:
192845
Local pid:
pubs:192845
Deposit date:
2024-07-23

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP