Conference item icon

Conference item

Learning equivariant structured output SVM regressors

Abstract:
Equivariance and invariance are often desired properties of a computer vision system. However, currently available strategies generally rely on virtual sampling, leaving open the question of how many samples are necessary, on the use of invariant feature representations, which can mistakenly discard information relevant to the vision task, or on the use of latent variable models, which result in non-convex training and expensive inference at test time. We propose here a generalization of structured output SVM regressors that can incorporate equivariance and invariance into a convex training procedure, enabling the incorporation of large families of transformations, while maintaining optimality and tractability. Importantly, test time inference does not require the estimation of latent variables, resulting in highly efficient objective functions. This results in a natural formulation for treating equivariance and invariance that is easily implemented as an adaptation of off-the-shelf optimization software, obviating the need for ad hoc sampling strategies. Theoretical results relating to vicinal risk, and experiments on challenging aerial car and pedestrian detection tasks show the effectiveness of the proposed solution. © 2011 IEEE.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1109/ICCV.2011.6126339

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0002-8945-8573


Publisher:
IEEE
Host title:
2011 International Conference on Computer Vision
Pages:
959-966
Publication date:
2012-01-12
Event title:
International Conference on Computer Vision (ICCV 2012)
Event location:
Barcelona, Spain
Event start date:
2011-11-06
Event end date:
2011-11-13
DOI:
EISSN:
2380-7504
ISSN:
1550-5499
EISBN:
978-1-4577-1102-2
ISBN:
978-1-4577-1101-5


Language:
English
Pubs id:
314494
UUID:
uuid:885ad60b-f557-4b9d-89a4-2e1426cda4c0
Local pid:
pubs:314494
Source identifiers:
314494
Deposit date:
2012-12-19

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP