MoDeep: A Deep Learning Framework for Human Pose Estimation

November 2014

MoDeep: A Deep Learning Framework for Human Pose Estimation

Illustration: NYU Accurate identification of people’s pose in video is of great importance. Its applications include gesture-based controls such as Kinect and motion capture systems without markers. New York University researchers recently developed a deep learning architecture using both color and motion features for human pose estimation. The deep learning framework, named MoDeep, based on a multi-resolution convolutional network, was published in a recent paper "MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation". The study has also proposed new motion features and created a new dataset called FLIC-motion by augmenting the Frames Labeled In Cinema (FLIC) dataset with the proposed motion features. According to the paper, MoDeep has been tested on the FLIC-motion dataset and outperforms existing state-of-the-art techniques for the task of human body pose detection in video. For more details about MoDeep, please visit http://cs.nyu.edu/~ajain/accv2014/.

Open Calls

Nomination/Position	Deadline
Call for Nominations for the SPS Chapter of the Year Award	15 October 2025
Call for Papers for 2026 LRAC Workshop	22 October 2025
Submit Your 2026 ICASSP Workshop Paper	22 October 2025
Submit a Proposal for ICASSP 2030	31 October 2025
Call for Project Proposals: IEEE SPS SigMA Program - Signal Processing Mentorship Academy	2 November 2025
Submit Your Proposals for 2026 Member-Driven Initiatives	21 November 2025
IEEE Signal Processing Society Annual Election Opens on 17 October	4 December 2025

Nomination/Position

Deadline

Call for Nominations for the SPS Chapter of the Year Award

15 October 2025