Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance
- 1 November 2011
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Subordinate-level categorization typically rests on establishing salient distinctions between part-level characteristics of objects, in contrast to basic-level categorization, where the presence or absence of parts is determinative. We develop an approach for subordinate categorization in vision, focusing on an avian domain due to the fine-grained structure of the category taxonomy for this domain. We explore a pose-normalized appearance model based on a volumetric poselet scheme. The variation in shape and appearance properties of these parts across a taxonomy provides the cues needed for subordinate categorization. Training pose detectors requires a relatively large amount of training data per category when done from scratch; using a subordinate-level approach, we exploit a pose classifier trained at the basic-level, and extract part appearance and shape information to build subordinate-level models. Our model associates the underlying image pattern parameters used for detection with corresponding volumetric part location, scale and orientation parameters. These parameters implicitly define a mapping from the image pixels into a pose-normalized appearance space, removing view and pose dependencies, facilitating fine-grained categorization from relatively few training examples.Keywords
This publication has 30 references indexed in Scilit:
- The Pascal Visual Object Classes (VOC) ChallengeInternational Journal of Computer Vision, 2009
- From Images to Shape Models for Object DetectionInternational Journal of Computer Vision, 2009
- A note on Platt’s probabilistic outputs for support vector machinesMachine Learning, 2007
- A Sparse Object Category Model for Efficient Learning and Exhaustive RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Histograms of Oriented Gradients for Human DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Pictorial Structures for Object RecognitionInternational Journal of Computer Vision, 2005
- Learning from one example through shared densities on transformsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Rotating objects to recognize them: A case study on the role of viewpoint dependency in the recognition of three-dimensional objectsPsychonomic Bulletin & Review, 1995
- Representation and recognition of the spatial organization of three-dimensional shapesProceedings of the Royal Society of London. B. Biological Sciences, 1978
- Basic objects in natural categoriesCognitive Psychology, 1976