Journal article
Learning discriminative space–time action parts from weakly labelled videos
- Abstract:
- Current state-of-the-art action classification methods aggregate space–time features globally, from the entire video clip under consideration. However, the features extracted may in part be due to irrelevant scene context, or movements shared amongst multiple action classes. This motivates learning with local discriminative parts, which can help localise which parts of the video are significant. Exploiting spatio-temporal structure in the video should also improve results, just as deformable part models have proven highly successful in object recognition. However, whereas objects have clear boundaries which means we can easily define a ground truth for initialisation, 3D space–time actions are inherently ambiguous and expensive to annotate in large datasets. Thus, it is desirable to adapt pictorial star models to action datasets without location annotation, and to features invariant to changes in pose such as bag-of-feature and Fisher vectors, rather than low-level HoG. Thus, we propose local deformable spatial bag-of-features in which local discriminative regions are split into a fixed grid of parts that are allowed to deform in both space and time at test-time. In our experimental evaluation we demonstrate that by using local space–time action parts in a weakly supervised setting, we are able to achieve state-of-the-art classification performance, whilst being able to localise actions even in the most challenging video datasets.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 522.7KB, Terms of use)
-
- Publisher copy:
- 10.1007/s11263-013-0662-8
Authors
- Publisher:
- Springer
- Journal:
- International Journal of Computer Vision More from this journal
- Volume:
- 110
- Issue:
- 1
- Pages:
- 30-47
- Publication date:
- 2013-10-13
- Acceptance date:
- 2013-09-25
- DOI:
- EISSN:
-
1573-1405
- ISSN:
-
0920-5691
- Language:
-
English
- Keywords:
- Pubs id:
-
971456
- Local pid:
-
pubs:971456
- Deposit date:
-
2024-05-20
Terms of use
- Copyright holder:
- Springer Science Business Media New York
- Copyright date:
- 2013
- Rights statement:
- Copyright © 2013, Springer Science Business Media New York
- Notes:
- This is the accepted manuscript version of the article. The final version is available online from Springer at https://dx.doi.org/10.1007/s11263-013-0662-8
If you are the owner of this record, you can report an update to it here: Report update to this record