Repository logo
 
Loading...
Thumbnail Image
Publication

Human Pose Estimation by a Series of Residual Auto-Encoders

Use this identifier to reference this record.
Name:Description:Size:Format: 
13271.pdf952.55 KBAdobe PDF Download

Advisor(s)

Abstract(s)

Pose estimation is the task of predicting the pose of an object in an image or in a sequence of images. Here, we focus on articulated human pose estimation in scenes with a single person. We employ a series of residual auto-encoders to produce multiple predictions which are then combined to provide a heatmap prediction of body joints. In this network topology, features are processed across all scales which captures the various spatial relationships associated with the body. Repeated bottom-up and top-down processing with intermediate supervision for each auto-encoder network is applied. We propose some improvements to this type of regression-based networks to further increase performance, namely: (a) increase the number of parameters of the auto-encoder networks in the pipeline, (b) use stronger regularization along with heavy data augmentation, (c) use sub-pixel precision for more precise joint localization, and (d) combine all auto-encoders output heatmaps into a single prediction, which further increases body joint prediction accuracy. We demonstrate state-of-the-art results on the popular FLIC and LSP datasets.

Description

Keywords

Citation

Organizational Units

Journal Issue

Publisher

Springer International Publishing Ag

Collections

CC License

Altmetrics

Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 8
see details