Is Relevance Relevant? User Relevance Ratings May Not Predict the Impact of Internet Search on Decision Outcomes
Open Access
- 1 July 2008
- journal article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 15 (4), 542-545
- https://doi.org/10.1197/jamia.m2663
Abstract
Objective: A common measure of Internet search engine effectiveness is its ability to find documents that a user perceives as ‘relevant’. This study sought to test whether user provided relevance ratings for documents retrieved by an Internet search engine correlate with the decision outcome after use of a search engine. Design: 227 university students were asked to answer four randomly assigned consumer health questions, then to conduct an Internet search on one of two randomly assigned search engines of different performance, and to again answer the question. Measurements: Participants were asked to provide a relevance score for each document retrieved as well as a pre and post search answer to each question. Results: User relevance rankings had little or no predictive power. Relevance rankings were unable to predict whether the user of a search engine could correctly answer a question after search and could not differentiate between two search engines with statistically different performance in the hands of users. Only when users had strong prior knowledge of the questions, and the decision task was of low complexity, did relevance appear to have modest predictive power. Conclusions: User provided relevance rankings taken in isolation seem to be of limited to no value when designing a search engine that will be used in a general-purpose setting. Relevance rankings may have a place in situations in which experts provide rankings, and decision tasks are of complexity commensurate with the abilities of the raters. A more natural metric of search engine performance may be a user's ability to accurately complete a task, as this removes the inherent subjectivity of relevance rankings, and provides a direct and repeatable outcome measure which directly correlates with the performance of the search technology in the hands of users.Keywords
This publication has 9 references indexed in Scilit:
- Impact of Web Searching and Social Feedback on Consumer Decision Making: A Prospective Online ExperimentJournal of Medical Internet Research, 2008
- A Bayesian model that predicts the impact of Web searching on decision makingJournal of the American Society for Information Science and Technology, 2006
- Architecture for Knowledge-Based and Federated Search of Online Clinical EvidenceJournal of Medical Internet Research, 2005
- Agreement, the F-Measure, and Reliability in Information RetrievalJournal of the American Medical Informatics Association, 2005
- Do Online Information Retrieval Systems Help Experienced Clinicians Answer Clinical Questions?Journal of the American Medical Informatics Association, 2005
- Which clinical decisions benefit from automation? A task complexity approachInternational Journal of Medical Informatics, 2003
- A new approach to the concept of "relevance" in information retrieval (IR).2001
- How Well Do Physicians Use Electronic Information Retrieval Systems?Published by American Medical Association (AMA) ,1998
- Relevance: The whole historyJournal of the American Society for Information Science, 1997