Conference item
Learning equivariant structured output SVM regressors
- Abstract:
- Equivariance and invariance are often desired properties of a computer vision system. However, currently available strategies generally rely on virtual sampling, leaving open the question of how many samples are necessary, on the use of invariant feature representations, which can mistakenly discard information relevant to the vision task, or on the use of latent variable models, which result in non-convex training and expensive inference at test time. We propose here a generalization of structured output SVM regressors that can incorporate equivariance and invariance into a convex training procedure, enabling the incorporation of large families of transformations, while maintaining optimality and tractability. Importantly, test time inference does not require the estimation of latent variables, resulting in highly efficient objective functions. This results in a natural formulation for treating equivariance and invariance that is easily implemented as an adaptation of off-the-shelf optimization software, obviating the need for ad hoc sampling strategies. Theoretical results relating to vicinal risk, and experiments on challenging aerial car and pedestrian detection tasks show the effectiveness of the proposed solution. © 2011 IEEE.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 447.8KB, Terms of use)
-
- Publisher copy:
- 10.1109/ICCV.2011.6126339
Authors
- Publisher:
- IEEE
- Host title:
- 2011 International Conference on Computer Vision
- Pages:
- 959-966
- Publication date:
- 2012-01-12
- Event title:
- International Conference on Computer Vision (ICCV 2012)
- Event location:
- Barcelona, Spain
- Event start date:
- 2011-11-06
- Event end date:
- 2011-11-13
- DOI:
- EISSN:
-
2380-7504
- ISSN:
-
1550-5499
- EISBN:
- 978-1-4577-1102-2
- ISBN:
- 978-1-4577-1101-5
- Language:
-
English
- Pubs id:
-
314494
- UUID:
-
uuid:885ad60b-f557-4b9d-89a4-2e1426cda4c0
- Local pid:
-
pubs:314494
- Source identifiers:
-
314494
- Deposit date:
-
2012-12-19
Terms of use
- Copyright holder:
- IEEE
- Copyright date:
- 2012
- Rights statement:
- © IEEE 2012
- Notes:
- This paper was presented at the International Conference on Computer Vision (ICCV 2012), 6th-13th November 2011, Barcelona, Spain. This is the accepted manuscript version of the article. The final version is available online from IEEE at https://dx.doi.org/10.1109/ICCV.2011.6126339
If you are the owner of this record, you can report an update to it here: Report update to this record