Conference item icon

Conference item

Face, body, voice: video person-clustering with multiple modalities

Abstract:

The objective of this work is person-clustering in videos – grouping characters according to their identity. Previous methods focus on the narrower task of face-clustering, and for the most part ignore other cues such as the person’s voice, their overall appearance (hair, clothes, posture), and the editing structure of the videos. Similarly, most current datasets evaluate only the task of face-clustering, rather than person-clustering. This limits their applicability to downstream application...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1109/iccvw54120.2021.00357

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573
Publisher:
IEEE Publisher's website
Host title:
2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
Pages:
3177-3187
Publication date:
2021-11-24
Acceptance date:
2021-07-22
Event title:
2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
Event location:
Virtual Event
Event website:
https://iccv2021.thecvf.com/home
Event start date:
2021-10-11
Event end date:
2021-10-17
DOI:
EISSN:
2473-9944
ISSN:
2473-9936
EISBN:
978-1-6654-0191-3
ISBN:
978-1-6654-0192-0
Language:
English
Keywords:
Pubs id:
1233021
Local pid:
pubs:1233021
Deposit date:
2022-01-19

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP