AI Collection

Conference item

Out of time: automated lip sync in the wild

Abstract:: The goal of this work is to determine the audio-video synchronisation between mouth motion and speech in a video.

We propose a two-stream ConvNet architecture that enables the mapping between the sound and the mouth images to be trained end-to-end from unlabelled data. The trained network is used to determine the lip-sync error in a video.

We apply the network to two further tasks: active speaker detection and lip reading. On both tasks we set a new state-of-the-art on standard benchmark datasets.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Chung, J., & Zisserman, A. (2017). Out of time: automated lip sync in the wild.

MLA Style

Chung, J, and A Zisserman. “Out of Time: Automated Lip Sync in the Wild.” 2017.

Chicago Style

Chung, J, and A Zisserman. 2017. “Out of Time: Automated Lip Sync in the Wild.”
Print

Access Document

Files:: Chung and Zisserman, Out of time - automated lip sync in the w...

(Preview, Accepted manuscript, pdf, 3.2MB, Terms of use)

Publisher copy:: 10.1007/978-3-319-54427-4_19

Authors

+ Chung, J More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Zisserman, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Engineering and Physical Sciences Research Council More from this funder

Grant:: EP/M013774/1

Publisher:: Springer
Host title:: Workshop on Multi-view Lip-reading, 13th Asian Conference on Computer Vision (ACCV 2016)
Journal:: 13th Asian Conference on Computer Vision More from this journal
Publication date:: 2017-03-01
Acceptance date:: 2016-05-27
Event location:: Taipei
DOI:: 10.1007/978-3-319-54427-4_19

Pubs id:: pubs:656453
UUID:: uuid:6bdd4768-6fbd-40ac-8efc-edca8a0325b3
Local pid:: pubs:656453
Source identifiers:: 656453
Deposit date:: 2016-11-01
ARK identifier:: ark:/29072/ora_6bdd47686fbd40ac8efcedca8a0325b3

Terms of use

Copyright holder:: Springer International Publishing AG
Copyright date:: 2017
Notes:: © Springer International Publishing AG 2017

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP