Names and faces in the news

12 November 2004

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 848-854
https://doi.org/10.1109/cvpr.2004.1315253

Abstract

We show quite good face clustering is possible for a dataset of inaccurately and ambiguously labelled face images. Our dataset is 44,773 face images, obtained by applying a face finder to approximately half a million captioned news images. This dataset is more realistic than usual face recognition datasets, because it contains faces captured "in the wild" in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Each face image is associated with a set of names, automatically extracted from the associated caption. Many, but not all such sets contain the correct name. We cluster face images in appropriate discriminant coordinates. We use a clustering procedure to break ambiguities in labelling and identify incorrectly labelled faces. A merging procedure then identifies variants of names that refer to the same individual. The resulting representation can be used to label faces in news images or to organize news pictures by individuals present. An alternative view of our procedure is as a process that cleans up noisy supervised data. We demonstrate how to use entropy measures to evaluate such procedures.

Keywords

This publication has 17 references indexed in Scilit:

Clustering art
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Spectral grouping using the nystrom method
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2004
Appearance-based face recognition and light-fields
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004
Using temporal coherence to build models of animals
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Face recognition using eigenfaces
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Discriminant analysis of principal components for face recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Finding faces in cluttered scenes using random labeled graph matching
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Face recognition using kernel eigenfaces
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Constructing models for content-based image retrieval
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2001
Nonlinear Component Analysis as a Kernel Eigenvalue Problem
Neural Computation, 1998

Cited by 147 articles