Query clustering using user logs

1 January 2002

journal article
Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems

Vol. 20 (1), 59-81
https://doi.org/10.1145/503104.503108

Abstract

Query clustering is a process used to discover frequently asked questions or most popular topics on a search engine. This process is crucial for search engines based on question-answering. Because of the short lengths of queries, approaches based on keywords are not suitable for query clustering. This paper describes a new query clustering method that makes use of user logs which allow us to identify the documents the users have selected for a query. The similarity between two queries may be deduced from the common documents the users selected for them. Our experiments show that a combination of both keywords and user logs is better than using either method alone.

Keywords

This publication has 12 references indexed in Scilit:

Agglomerative clustering of a search engine query log
Published by Association for Computing Machinery (ACM) ,2000
A question answering system supported by information extraction
Published by Association for Computational Linguistics (ACL) ,2000
Algorithms on Strings, Trees and Sequences
Published by Cambridge University Press (CUP) ,1997
Automatic feedback using past queries
Published by Association for Computing Machinery (ACM) ,1997
Query expansion using local and global document analysis
Published by Association for Computing Machinery (ACM) ,1996
Learning collection fusion strategies
Published by Association for Computing Machinery (ACM) ,1995
Introduction to WordNet: An On-line Lexical Database^*
International Journal of Lexicography, 1990
Term clustering of syntactic phrases
Published by Association for Computing Machinery (ACM) ,1989
An algorithm for suffix stripping
Program: electronic library and information systems, 1980
Bibliographic coupling between scientific papers
American Documentation, 1963

Cited by 249 articles