A Study on Pubmed Search Tag Usage Pattern: Association Rule Mining of a Full-day Pubmed Query Log
Open Access
- 9 January 2013
- journal article
- Published by Springer Science and Business Media LLC in BMC Medical Informatics and Decision Making
- Vol. 13 (1), 8
- https://doi.org/10.1186/1472-6947-13-8
Abstract
The practice of evidence-based medicine requires efficient biomedical literature search such as PubMed/MEDLINE. Retrieval performance relies highly on the efficient use of search field tags. The purpose of this study was to analyze PubMed log data in order to understand the usage pattern of search tags by the end user in PubMed/MEDLINE search. A PubMed query log file was obtained from the National Library of Medicine containing anonymous user identification, timestamp, and query text. Inconsistent records were removed from the dataset and the search tags were extracted from the query texts. A total of 2,917,159 queries were selected for this study issued by a total of 613,061 users. The analysis of frequent co-occurrences and usage patterns of the search tags was conducted using an association mining algorithm. The percentage of search tag usage was low (11.38% of the total queries) and only 2.95% of queries contained two or more tags. Three out of four users used no search tag and about two-third of them issued less than four queries. Among the queries containing at least one tagged search term, the average number of search tags was almost half of the number of total search terms. Navigational search tags are more frequently used than informational search tags. While no strong association was observed between informational and navigational tags, six (out of 19) informational tags and six (out of 29) navigational tags showed strong associations in PubMed searches. The low percentage of search tag usage implies that PubMed/MEDLINE users do not utilize the features of PubMed/MEDLINE widely or they are not aware of such features or solely depend on the high recall focused query translation by the PubMed’s Automatic Term Mapping. The users need further education and interactive search application for effective use of the search tags in order to fulfill their biomedical information needs from PubMed/MEDLINE.Keywords
This publication has 47 references indexed in Scilit:
- Using PubMed in radiology: Ten useful tips for radiologistsIndian Journal of Radiology and Imaging, 2011
- Semi-automatic semantic annotation of PubMed queries: A study on quality, efficiency, satisfactionJournal of Biomedical Informatics, 2010
- Mining connections between chemicals, proteins, and diseases extracted from Medline annotationsJournal of Biomedical Informatics, 2010
- The WEKA data mining softwareACM SIGKDD Explorations Newsletter, 2009
- Improving accuracy for identifying related PubMed queries by an integrated approachJournal of Biomedical Informatics, 2009
- Identifying related journals through log analysisBioinformatics, 2009
- Understanding PubMed(R) user search behavior through log analysisDatabase: The Journal of Biological Databases and Curation, 2009
- Evaluation of query expansion using MeSH in PubMedInformation Retrieval Journal, 2008
- Analysis of queries sent to PubMed at the point of care: Observation of search behaviour in a medical teaching hospitalBMC Medical Informatics and Decision Making, 2008
- A Day in the Life of PubMed: Analysis of a Typical Day's Query LogJournal of the American Medical Informatics Association, 2007