Inductive Hashing on Manifolds

Abstract

Learning based hashing methods have attracted considerable attention due to their ability to greatly increase the scale at which existing algorithms may operate. Most of these methods are designed to generate binary codes that preserve the Euclidean distance in the original space. Manifold learning techniques, in contrast, are better able to model the intrinsic structure embedded in the original high-dimensional data. The complexity of these models, and the problems with out-of-sample data, have previously rendered them unsuitable for application to large-scale embedding, however. In this work, we consider how to learn compact binary embeddings on their intrinsic manifolds. In order to address the above-mentioned difficulties, we describe an efficient, inductive solution to the out-of-sample data problem, and a process by which non-parametric manifold learning may be used as the basis of a hashing method. Our proposed approach thus allows the development of a range of new hashing techniques exploiting the flexibility of the wide variety of manifold learning approaches available. We particularly show that hashing on the basis of t-SNE [29] outperforms state-of-the-art hashing methods on large-scale benchmark datasets, and is very effective for image classification with very short code lengths.

Keywords

This publication has 16 references indexed in Scilit:

Random maximum margin hashing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Self-taught hashing for fast similarity search
Published by Association for Computing Machinery (ACM) ,2010
Kernelized locality-sensitive hashing for scalable image search
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Fast Similarity Search for Learned Metrics
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009
Introduction to Information Retrieval
Published by Cambridge University Press (CUP) ,2008
Large-scale manifold learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Diffusion maps and coarse-graining: a unified framework for dimensionality reduction, graph partitioning, and data set parameterization
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision, 2001
Nonlinear Dimensionality Reduction by Locally Linear Embedding
Science, 2000
A Global Geometric Framework for Nonlinear Dimensionality Reduction
Science, 2000

Cited by 203 articles