Conference item icon

Conference item

Compact deep aggregation for set retrieval

Abstract:

The objective of this work is to learn a compact embedding of a set of descriptors that is suitable for efficient retrieval and ranking, whilst maintaining discriminability of the individual descriptors. We focus on a specific example of this general problem – that of retrieving images containing multiple faces from a large scale dataset of images. Here the set consists of the face descriptors in each image, and given a query for multiple identities, the goal is then to retrieve, in order, images which contain all the identities, all but one, etc.

To this end, we make the following contributions: first, we propose a CNN architecture – SetNet – to achieve the objective: it learns face descriptors and their aggregation over a set to produce a compact fixed length descriptor designed for set retrieval, and the score of an image is a count of the number of identities that match the query; second, we show that this compact descriptor has minimal loss of discriminability up to two faces per image, and degrades slowly after that – far exceeding a number of baselines; third, we explore the speed vs. retrieval quality trade-off for set retrieval using this compact descriptor; and, finally, we collect and annotate a large dataset of images containing various number of celebrities, which we use for evaluation and will be publicly released.

Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1007/978-3-030-11018-5_36

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS Division
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573


Publisher:
Springer Verlag
Host title:
European Conference on Computer Vision, 2018, Munich, Germany, September 8-14, 2018, Proceedings, Part IV
Journal:
European Conference on Computer Vision, 2018 More from this journal
Volume:
11132
Pages:
413-430
Series:
Lecture Notes in Computer Science
Publication date:
2019-01-23
Acceptance date:
2018-09-09
DOI:
ISBN:
9783030110178


Pubs id:
pubs:950921
UUID:
uuid:99c6b403-94b2-47f3-b8bb-8935d1608b82
Local pid:
pubs:950921
Source identifiers:
950921
Deposit date:
2018-12-06
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP