End-to-end learning of deformable mixture of parts and deep convolutional neural networks for human pose estimation
Refereed conference paper presented and published in conference proceedings

Full Text

Times Cited

Other information
AbstractRecently, Deep Convolutional Neural Networks (DC-NNs) have been applied to the task of human pose estimation, and have shown its potential of learning better feature representations and capturing contextual relationships. However, it is difficult to incorporate domain prior knowledge such as geometric relationships among body parts into DCNNs. In addition, training DCNN-based body part detectors without consideration of global body joint consistency introduces ambiguities, which increases the complexity of training. In this paper, we propose a novel end-to-end framework for human pose estimation that combines DC-NNs with the expressive deformable mixture of parts. We explicitly incorporate domain prior knowledge into the framework, which greatly regularizes the learning process and enables the flexibility of our framework for loopy models or tree-structured models. The effectiveness of jointly learning a DCNN with a deformable mixture of parts model is evaluated through intensive experiments on several widely used benchmarks. The proposed approach significantly improves the performance compared with state-of-the-art approaches, especially on benchmarks with challenging articulations.
All Author(s) ListYang W., Ouyang W., Li H., Wang X.
Name of Conference2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
Start Date of Conference26/06/2016
End Date of Conference01/07/2016
Place of ConferenceLas Vegas
Country/Region of ConferenceUnited States of America
Volume Number2016-January
Pages3073 - 3082
LanguagesEnglish-United Kingdom

Last updated on 2020-02-09 at 01:25