Conference item icon

Conference item

Recurrent human pose estimation

Abstract:
We propose a ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an architecture combining a feed forward module with a recurrent module, where the recurrent module can be run iteratively to improve the performance; (ii) the model can be trained end-to-end and from scratch, with auxiliary losses incorporated to improve performance; (iii) we investigate whether keypoint visibility can also be predicted. The model is evaluated on two benchmark datasets. The result is a simple architecture that achieves performance on par with the state of the art, but without the complexity of a graphical model stage (or layers).
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1109/FG.2017.64

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Oxford college:
Brasenose College
Role:
Author


Publisher:
Institute of Electrical and Electronics Engineers
Host title:
12th IEEE Conference on Automatic Face and Gesture Recognition
Journal:
12th IEEE Conference on Automatic Face and Gesture Recognition More from this journal
Publication date:
2017-06-01
Acceptance date:
2016-10-10
Event location:
Washington DC
DOI:


Pubs id:
pubs:684068
UUID:
uuid:f28ba2c9-b4fb-41b6-9927-79f38b4ce388
Local pid:
pubs:684068
Source identifiers:
684068
Deposit date:
2017-03-06
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP