Mining near-duplicate graph for cluster-based reranking of web video search results

23 November 2010

journal article
Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems

Vol. 28 (4), 1-27
https://doi.org/10.1145/1852102.1852108

Abstract

Recently, video search reranking has been an effective mechanism to improve the initial text-based ranking list by incorporating visual consistency among the result videos. While existing methods attempt to rerank all the individual result videos, they suffer from several drawbacks. In this article, we propose a new video reranking paradigm called cluster-based video reranking (CVR). The idea is to first construct a video near-duplicate graph representing the visual similarity relationship among videos, followed by identifying the near-duplicate clusters from the video near-duplicate graph, then ranking the obtained near-duplicate clusters based on cluster properties and intercluster links, and finally for each ranked cluster, a representative video is selected and returned. Compared to existing methods, the new CVR ranks clusters and exhibits several advantages, including superior reranking by utilizing more reliable cluster properties, fast reranking on a small number of clusters, diverse and representative results. Particularly, we formulate the near-duplicate cluster identification as a novel maximally cohesive subgraph mining problem. By leveraging the designed cluster scoring properties indicating the cluster's importance and quality, random walk is applied over the near-duplicate cluster graph to rank clusters. An extensive evaluation study proves the novelty and superiority of our proposals over existing methods.

Keywords

This publication has 23 references indexed in Scilit:

Understanding near-duplicate videos
Published by Association for Computing Machinery (ACM) ,2009
Bounded coordinate system indexing for real-time video clip search
ACM Transactions on Information Systems, 2009
Video search re-ranking via multi-graph propagation
Published by Association for Computing Machinery (ACM) ,2007
Video search reranking through random walk over document-level context graph
Published by Association for Computing Machinery (ACM) ,2007
Video copy detection
Published by Association for Computing Machinery (ACM) ,2007
Clustering by Passing Messages Between Data Points
Science, 2007
Video search reranking via information bottleneck principle
Published by Association for Computing Machinery (ACM) ,2006
Graph mining
ACM Computing Surveys, 2006
Pushing Tougher Constraints in Frequent Pattern Mining
Lecture Notes in Computer Science, 2005
The use of MMR, diversity-based reranking for reordering documents and producing summaries
Published by Association for Computing Machinery (ACM) ,1998

Cited by 22 articles