Conference item
Learning discriminative space-time actions from weakly labelled videos
- Abstract:
- Current state-of-the-art action classification methods extract feature representations from the entire video clip in which the action unfolds, however this representation may include irrelevant scene context and movements which are shared amongst multiple action classes. For example, a waving action may be performed whilst walking, however if the walking movement and scene context appear in other action classes, then they should not be included in a waving movement classifier. In this work, we propose an action classification framework in which more discriminative action subvolumes are learned in a weakly supervised setting, owing to the difficulty of manually labelling massive video datasets. The learned models are used to simultaneously classify video clips and to localise actions to a given space-time subvolume. Each subvolume is cast as a bag-offeatures (BoF) instance in a multiple-instance-learning framework, which in turn is used to learn its class membership. We demonstrate quantitatively that even with single fixedsized subvolumes, the classification performance of our proposed algorithm is superior to the state-of-the-art BoF baseline on the majority of performance measures, and shows promise for space-time action localisation on the most challenging video datasets.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 501.3KB, Terms of use)
-
- Publisher copy:
- 10.5244/c.26.123
Authors
- Publisher:
- British Machine Vision Association and Society for Pattern Recognition
- Host title:
- Electronic Proceedings of the British Machine Vision Conference 2012
- Pages:
- 123.1-123.12
- Publication date:
- 2012-09-03
- Event title:
- British Machine Vision Conference 2012 (BMVC 2012)
- Event location:
- Guildford, Surrey
- Event website:
- https://bmva-archive.org.uk/bmvc/2012/index.html
- Event start date:
- 2012-09-03
- Event end date:
- 2012-09-07
- DOI:
- EISBN:
- 1901725464
- Language:
-
English
- Pubs id:
-
971464
- Local pid:
-
pubs:971464
- Deposit date:
-
2024-05-17
- ARK identifier:
Terms of use
- Copyright holder:
- Sapienza et al.
- Copyright date:
- 2012
- Rights statement:
- © 2012. The copyright of this document resides with its authors. It may be distributed unchanged freely in print or electronic forms.
If you are the owner of this record, you can report an update to it here: Report update to this record