MoDeep: A Deep Learning Framework for Human Pose Estimation

You are here

Inside Signal Processing Newsletter Home Page

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

10 years of news and resources for members of the IEEE Signal Processing Society

MoDeep: A Deep Learning Framework for Human Pose Estimation

Illustration: NYU

Illustration: NYU

Accurate identification of people’s pose in video is of great importance. Its applications include gesture-based controls such as Kinect and motion capture systems without markers. New York University researchers recently developed a deep learning architecture using both color and motion features for human pose estimation.

The deep learning framework, named MoDeep, based on a multi-resolution convolutional network, was published in a recent paper "MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation". The study has also proposed new motion features and created a new dataset called FLIC-motion by augmenting the Frames Labeled In Cinema (FLIC) dataset with the proposed motion features. According to the paper, MoDeep has been tested on the FLIC-motion dataset and outperforms existing state-of-the-art techniques for the task of human body pose detection in video.

For more details about MoDeep, please visit http://cs.nyu.edu/~ajain/accv2014/.

SPS on Twitter

SPS Videos


Signal Processing in Home Assistants

 


Multimedia Forensics


Careers in Signal Processing             

 


Under the Radar