Efficient Computation of Diverse Query Results
- 1 April 2008
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 32 (10636382), 228-236
- https://doi.org/10.1109/icde.2008.4497431
Abstract
We study the problem of efficiently computing diverse query results in online shopping applications, where users specify queries through a form interface that allows a mix of structured and content-based selection conditions. Intuitively, the goal of diverse query answering is to return a representative set of top-k answers from all the tuples that satisfy the user selection condition. For example, if a user is searching for Honda cars and we can only display five results, we wish to return cars from five different Honda models, as opposed to returning cars from only one or two Honda models. A key contribution of this paper is to formally define the notion of diversity, and to show that existing score based techniques commonly used in web applications are not sufficient to guarantee diversity. Another contribution of this paper is to develop novel and efficient query processing techniques that guarantee diversity. Our experimental results using Yahoo! Autos data show that our proposed techniques are scalable and efficient.Keywords
This publication has 3 references indexed in Scilit:
- Extracting redundancy-aware top-k patternsPublished by Association for Computing Machinery (ACM) ,2006
- Top- k selection queries over relational databasesACM Transactions on Database Systems, 2002
- Combining Fuzzy Information from Multiple SystemsJournal of Computer and System Sciences, 1999