Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance

1 November 2011

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 161-168
https://doi.org/10.1109/iccv.2011.6126238

Abstract

Subordinate-level categorization typically rests on establishing salient distinctions between part-level characteristics of objects, in contrast to basic-level categorization, where the presence or absence of parts is determinative. We develop an approach for subordinate categorization in vision, focusing on an avian domain due to the fine-grained structure of the category taxonomy for this domain. We explore a pose-normalized appearance model based on a volumetric poselet scheme. The variation in shape and appearance properties of these parts across a taxonomy provides the cues needed for subordinate categorization. Training pose detectors requires a relatively large amount of training data per category when done from scratch; using a subordinate-level approach, we exploit a pose classifier trained at the basic-level, and extract part appearance and shape information to build subordinate-level models. Our model associates the underlying image pattern parameters used for detection with corresponding volumetric part location, scale and orientation parameters. These parameters implicitly define a mapping from the image pixels into a pose-normalized appearance space, removing view and pose dependencies, facilitating fine-grained categorization from relatively few training examples.

Keywords

This publication has 30 references indexed in Scilit:

The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision, 2009
From Images to Shape Models for Object Detection
International Journal of Computer Vision, 2009
A note on Platt’s probabilistic outputs for support vector machines
Machine Learning, 2007
A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Pictorial Structures for Object Recognition
International Journal of Computer Vision, 2005
Learning from one example through shared densities on transforms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Rotating objects to recognize them: A case study on the role of viewpoint dependency in the recognition of three-dimensional objects
Psychonomic Bulletin & Review, 1995
Representation and recognition of the spatial organization of three-dimensional shapes
Proceedings of the Royal Society of London. B. Biological Sciences, 1978
Basic objects in natural categories
Cognitive Psychology, 1976

Cited by 127 articles