• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

A Multi-Stage Convolution Machine with Scaling and Dilation for Human Pose Estimation

Vol. 13, No. 6, June 29, 2019
10.3837/tiis.2019.06.023, Download Paper (Free):

Abstract

Vision-based Human Pose Estimation has been considered as one of challenging research subjects due to problems including confounding background clutter, diversity of human appearances and illumination changes in scenes. To tackle these problems, we propose to use a new multi-stage convolution machine for estimating human pose. To provide better heatmap prediction of body joints, the proposed machine repeatedly produces multiple predictions according to stages with receptive field large enough for learning the long-range spatial relationship. And stages are composed of various modules according to their strategic purposes. Pyramid stacking module and dilation module are used to handle problem of human pose at multiple scales. Their multi-scale information from different receptive fields are fused with concatenation, which can catch more contextual information from different features. And spatial and channel information of a given input are converted to gating factors by squeezing the feature maps to a single numeric value based on its importance in order to give each of the network channels different weights. Compared with other ConvNet-based architectures, we demonstrated that our proposed architecture achieved higher accuracy on experiments using standard benchmarks of LSP and MPII pose datasets.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Y. Nie, J. Lee, S. Yoon and D. S. Park, "A Multi-Stage Convolution Machine with Scaling and Dilation for Human Pose Estimation," KSII Transactions on Internet and Information Systems, vol. 13, no. 6, pp. 3182-3198, 2019. DOI: 10.3837/tiis.2019.06.023.

[ACM Style]
Yali Nie, Jaehwan Lee, Sook Yoon, and Dong Sun Park. 2019. A Multi-Stage Convolution Machine with Scaling and Dilation for Human Pose Estimation. KSII Transactions on Internet and Information Systems, 13, 6, (2019), 3182-3198. DOI: 10.3837/tiis.2019.06.023.