Is Relevance Relevant? User Relevance Ratings May Not Predict the Impact of Internet Search on Decision Outcomes

Open Access

1 July 2008

journal article
Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association

Vol. 15 (4), 542-545
https://doi.org/10.1197/jamia.m2663

Abstract

Objective: A common measure of Internet search engine effectiveness is its ability to find documents that a user perceives as ‘relevant’. This study sought to test whether user provided relevance ratings for documents retrieved by an Internet search engine correlate with the decision outcome after use of a search engine. Design: 227 university students were asked to answer four randomly assigned consumer health questions, then to conduct an Internet search on one of two randomly assigned search engines of different performance, and to again answer the question. Measurements: Participants were asked to provide a relevance score for each document retrieved as well as a pre and post search answer to each question. Results: User relevance rankings had little or no predictive power. Relevance rankings were unable to predict whether the user of a search engine could correctly answer a question after search and could not differentiate between two search engines with statistically different performance in the hands of users. Only when users had strong prior knowledge of the questions, and the decision task was of low complexity, did relevance appear to have modest predictive power. Conclusions: User provided relevance rankings taken in isolation seem to be of limited to no value when designing a search engine that will be used in a general-purpose setting. Relevance rankings may have a place in situations in which experts provide rankings, and decision tasks are of complexity commensurate with the abilities of the raters. A more natural metric of search engine performance may be a user's ability to accurately complete a task, as this removes the inherent subjectivity of relevance rankings, and provides a direct and repeatable outcome measure which directly correlates with the performance of the search technology in the hands of users.

Keywords

This publication has 9 references indexed in Scilit:

Impact of Web Searching and Social Feedback on Consumer Decision Making: A Prospective Online Experiment
Journal of Medical Internet Research, 2008
A Bayesian model that predicts the impact of Web searching on decision making
Journal of the American Society for Information Science and Technology, 2006
Architecture for Knowledge-Based and Federated Search of Online Clinical Evidence
Journal of Medical Internet Research, 2005
Agreement, the F-Measure, and Reliability in Information Retrieval
Journal of the American Medical Informatics Association, 2005
Do Online Information Retrieval Systems Help Experienced Clinicians Answer Clinical Questions?
Journal of the American Medical Informatics Association, 2005
Which clinical decisions benefit from automation? A task complexity approach
International Journal of Medical Informatics, 2003
A new approach to the concept of "relevance" in information retrieval (IR).
2001
How Well Do Physicians Use Electronic Information Retrieval Systems?
Published by American Medical Association (AMA) ,1998
Relevance: The whole history
Journal of the American Society for Information Science, 1997

Cited by 4 articles