Conference item icon

Conference item

Return of the devil in the details: delving deep into convolutional nets

Abstract:
The latest generation of Convolutional Neural Networks (CNN) have achieved impressive results in challenging benchmarks on image recognition and object detection, significantly raising the interest of the community in these methods. Nevertheless, it is still unclear how different CNN methods compare with each other and with previous state-of-the-art shallow representations such as the Bag-of-Visual-Words and the Improved Fisher Vector. This paper conducts a rigorous evaluation of these new techniques, exploring different deep architectures and comparing them on a common ground, identifying and disclosing important implementation details. We identify several useful properties of CNN-based representations, including the fact that the dimensionality of the CNN output layer can be reduced significantly without having an adverse effect on performance. We also identify aspects of deep and shallow methods that can be successfully shared. In particular, we show that the data augmentation techniques commonly applied to CNN-based methods can also be applied to shallow methods, and result in an analogous performance boost. Source code and models to reproduce the experiments in the paper is made publicly available.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publication website:
https://bmva-archive.org.uk/bmvc/2014/papers/paper054/index.html

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
New College
Role:
Author
ORCID:
0000-0003-1374-2858
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573


More from this funder
Funder identifier:
https://ror.org/0472cxd90
Grant:
228180


Publisher:
British Machine Vision Association and Society for Pattern Recognition
Host title:
Proceedings of the British Machine Vision Conference 2014
Pages:
6.1-6.12
Article number:
54
Publication date:
2014-09-05
Acceptance date:
2014-06-30
Event title:
25th British Machine Vision Conference (BMVC 2014)
Event location:
Nottingham, UK
Event website:
https://bmva-archive.org.uk/bmvc/2014/index.html
Event start date:
2014-09-01
Event end date:
2014-09-05
ISBN:
1901725529


Language:
English
Pubs id:
1119459
Local pid:
pubs:1119459
Deposit date:
2024-07-12

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP