Conference item
Learnable PINs: Cross-modal embeddings for person identity
- Abstract:
- We propose and investigate an identity sensitive joint embedding of face and voice. Such an embedding enables cross-modal retrieval from voice to face and from face to voice. We make the following four contributions: first, we show that the embedding can be learnt from videos of talking faces, without requiring any identity labels, using a form of cross-modal self-supervision; second, we develop a curriculum learning schedule for hard negative mining targeted to this task that is essential for learning to proceed successfully; third, we demonstrate and evaluate cross-modal retrieval for identities unseen and unheard during training over a number of scenarios and establish a benchmark for this novel task; finally, we show an application of using the joint embedding for automatically retrieving and labelling characters in TV dramas.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 3.0MB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-030-01261-8_5
Authors
- Publisher:
- Springer
- Host title:
- Lecture Notes in Computer Science
- Journal:
- Lecture Notes in Computer Science More from this journal
- Volume:
- 11217
- Pages:
- 73-89
- Publication date:
- 2018-10-06
- Acceptance date:
- 2018-07-03
- DOI:
- ISSN:
-
1611-3349 and 0302-9743
- ISBN:
- 9783030012601
- Keywords:
- Pubs id:
-
pubs:941298
- UUID:
-
uuid:0ef631f1-97c5-4b3e-b913-6e3115cad81a
- Local pid:
-
pubs:941298
- Source identifiers:
-
941298
- Deposit date:
-
2018-11-20
Terms of use
- Copyright holder:
- Springer Nature
- Copyright date:
- 2018
- Notes:
- © Springer Nature Switzerland AG 2018. This paper was presented at the European Conference on Computer Vision 2018. This is the accepted manuscript version of the article. The final version is available online from Springer at: 10.1007/978-3-030-01261-8_5
If you are the owner of this record, you can report an update to it here: Report update to this record