Conference item icon

Conference item

Structured output regression for detection with partial truncation

Abstract:
We develop a structured output model for object category detection that explicitly accounts for alignment, multiple aspects and partial truncation in both training and inference. The model is formulated as large margin learning with latent variables and slack rescaling, and both training and inference are computationally efficient. We make the following contributions: (i) we note that extending the Structured Output Regression formulation of Blaschko and Lampert [1] to include a bias term significantly improves performance; (ii) that alignment (to account for small rotations and anisotropic scalings) can be included as a latent variable and efficiently determined and implemented; (iii) that the latent variable extends to multiple aspects (e.g. left facing, right facing, front) with the same formulation; and (iv), most significantly for performance, that truncated and truncated instances can be included in both training and inference with an explicit truncation mask. We demonstrate the method by training and testing on the PASCAL VOC 2007 data set - training includes the truncated examples, and in testing object instances are detected at multiple scales, alignments, and with significant truncations.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0002-8945-8573


Publisher:
Curran Associates
Host title:
Advances in Neural Information Processing Systems 22
Volume:
1
Pages:
1928-1936
Publication date:
2010-04-01
Event title:
23rd Annual Conference on Neural Information Processing Systems (NeurIPS 2009)
Event location:
Vancouver, BC, Canada
Event website:
https://neurips.cc/Conferences/2009
Event start date:
2010-12-07
Event end date:
2010-12-10
ISSN:
1049-5258
ISBN:
9781615679119


Language:
English
Pubs id:
321073
UUID:
uuid:1141203e-5718-4380-b532-7791941ea15c
Local pid:
pubs:321073
Source identifiers:
321073
Deposit date:
2013-11-17
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP