Conference item
Face, body, voice: video person-clustering with multiple modalities
- Abstract:
-
The objective of this work is person-clustering in videos – grouping characters according to their identity. Previous methods focus on the narrower task of face-clustering, and for the most part ignore other cues such as the person’s voice, their overall appearance (hair, clothes, posture), and the editing structure of the videos. Similarly, most current datasets evaluate only the task of face-clustering, rather than person-clustering. This limits their applicability to downstream application...
Expand abstract
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Authors
Bibliographic Details
- Publisher:
- IEEE Publisher's website
- Host title:
- 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
- Pages:
- 3177-3187
- Publication date:
- 2021-11-24
- Acceptance date:
- 2021-07-22
- Event title:
- 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
- Event location:
- Virtual Event
- Event website:
- https://iccv2021.thecvf.com/home
- Event start date:
- 2021-10-11
- Event end date:
- 2021-10-17
- DOI:
- EISSN:
-
2473-9944
- ISSN:
-
2473-9936
- EISBN:
- 978-1-6654-0191-3
- ISBN:
- 978-1-6654-0192-0
Item Description
- Language:
- English
- Keywords:
- Pubs id:
-
1233021
- Local pid:
- pubs:1233021
- Deposit date:
- 2022-01-19
Terms of use
- Copyright holder:
- IEEE
- Copyright date:
- 2021
- Rights statement:
- © 2021 IEEE
- Notes:
- This is the accepted manuscript version of the paper. The final version is available online from IEEE at https://doi.org/10.1109/ICCVW54120.2021.00357
Metrics
If you are the owner of this record, you can report an update to it here: Report update to this record