PANDA: Pose Aligned Networks for Deep Attribute Modeling

1 June 2014

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 1637-1644
https://doi.org/10.1109/cvpr.2014.212

Abstract

We propose a method for inferring human attributes (such as gender, hair style, clothes style, expression, action) from images of people under large variation of viewpoint, pose, appearance, articulation and occlusion. Convolutional Neural Nets (CNN) have been shown to perform very well on large scale object recognition problems. In the context of attribute classification, however, the signal is often subtle and it may cover only a small part of the image, while the image is dominated by the effects of pose and viewpoint. Discounting for pose variation would require training on very large labeled datasets which are not presently available. Part-based models, such as poselets [4] and DPM [12] have been shown to perform well for this problem but they are limited by shallow low-level features. We propose a new method which combines part-based models and deep learning by training pose-normalized CNNs. We show substantial improvement vs. state-of-the-art methods on challenging attribute classification tasks in unconstrained settings. Experiments confirm that our method outperforms both the best part-based methods on this problem and conventional CNNs trained on the full bounding box of the person.

Keywords

Other Versions

Version 2, 2013-11-21, preprints

This publication has 18 references indexed in Scilit:

Human Attribute Recognition by Rich Appearance Dictionary
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Relative attributes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Object Detection with Discriminatively Trained Part-Based Models
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009
What is the best multi-stage architecture for object recognition?
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Attribute and simile classifiers for face verification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Learning to detect unseen object classes by between-class attribute transfer
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998
Backpropagation Applied to Handwritten Zip Code Recognition
Neural Computation, 1989

Cited by 352 articles