Conference item icon

Conference item

Lip reading in the wild

Abstract:

Our aim is to recognise the words being spoken by a talking face, given only the video but not the audio. Existing works in this area have focussed on trying to recognise a small number of utterances in controlled environments (e.g. digits and alphabets), partially due to the shortage of suitable datasets.


We make two novel contributions: first, we develop a pipeline for fully automated large-scale data collection from TV broadcasts. With this we have generated a dataset with ...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed
Version:
Accepted Manuscript

Actions


Access Document


Files:
Publisher copy:
10.1007/978-3-319-54184-6_6

Authors


More by this author
Department:
Oxford, MPLS, Engineering Science
More by this author
Department:
Oxford, MPLS, Engineering Science
Publisher:
Springer, Cham Publisher's website
Volume:
10112
Pages:
87-103
Publication date:
2017
Acceptance date:
2016-05-27
DOI:
ISSN:
0302-9743
Pubs id:
pubs:656449
URN:
uri:c3238375-ec8b-4ecd-9543-8b179a6b74ba
UUID:
uuid:c3238375-ec8b-4ecd-9543-8b179a6b74ba
Local pid:
pubs:656449
ISBN:
978-3-319-54183-9

Terms of use


Metrics



If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP