Conference item icon

Conference item

High five: recognising human interactions in TV shows

Abstract:
In this paper we address the problem of recognising interactions between two people in realistic scenarios for video retrieval purposes. We develop a per-person descriptor that uses attention (head orientation) and the local spatial and temporal context in a neighbourhood of each detected person. Using head orientation mitigates camera view ambiguities, while the local context, comprised of histograms of gradients and motion, aims to capture cues such as hand and arm movement. We also employ structured learning to capture spatial relationships between interacting individuals.
We train an initial set of one-vs-the-rest linear SVM classifiers, one for each interaction, using this descriptor. Noting that people generally face each other while interacting, we learn a structured SVM that combines head orientation and the relative location of people in a frame to improve upon the initial classification obtained with our descriptor. To test the efficacy of our method, we have created a new dataset of realistic human interactions comprised of clips extracted from TV shows, which represents a very difficult challenge. Our experiments show that using structured learning improves the retrieval results compared to using the interaction classifiers independently
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publication website:
https://bmva-archive.org.uk/bmvc/2010/conference/paper50/index.html

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
ORCID:
0000-0002-8945-8573
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author


More from this funder
Funder identifier:
https://ror.org/0472cxd90
Grant:
228180
Programme:
VisRec
More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/D037077/1


Publisher:
British Machine Vision Association
Host title:
Proceedings of the 21st British Machine Vision Conference (BMVC 2010)
Pages:
50.1-50.11
Publication date:
2010-08-31
Event title:
21st British Machine Vision Conference (BMVC 2010)
Event location:
Aberystwyth, Wales
Event website:
https://bmva-archive.org.uk/bmvc/2010/index.html
Event start date:
2010-08-31
Event end date:
2010-09-03
ISBN:
1901725405


Language:
English
Pubs id:
327023
Local pid:
pubs:327023
Deposit date:
2024-07-23

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP