Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviews
Open Access
- 23 August 2013
- journal article
- research article
- Published by Wiley in Research Synthesis Methods
- Vol. 5 (1), 31-49
- https://doi.org/10.1002/jrsm.1093
Abstract
In scoping reviews, boundaries of relevant evidence may be initially fuzzy, with refined conceptual understanding of interventions and their proposed mechanisms of action an intended output of the scoping process rather than its starting point. Electronic searches are therefore sensitive, often retrieving very large record sets that are impractical to screen in their entirety. This paper describes methods for applying and evaluating the use of text mining (TM) technologies to reduce impractical screening workload in reviews, using examples of two extremely large‐scale scoping reviews of public health evidence (choice architecture (CA) and economic environment (EE)). Electronic searches retrieved >800,000 (CA) and >1 million (EE) records. TM technologies were used to prioritise records for manual screening. TM performance was measured prospectively. TM reduced manual screening workload by 90% (CA) and 88% (EE) compared with conventional screening (absolute reductions of ≈430 000 (CA) and ≈378 000 (EE) records). This study expands an emerging corpus of empirical evidence for the use of TM to expedite study selection in reviews. By reducing screening workload to manageable levels, TM made it possible to assemble and configure large, complex evidence bases that crossed research discipline boundaries. These methods are transferable to other scoping and systematic reviews incorporating conceptual development or explanatory dimensions. © 2013 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.Keywords
This publication has 20 references indexed in Scilit:
- Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviewsResearch Synthesis Methods, 2013
- Changing Human Behavior to Prevent Disease: The Importance of Targeting Automatic ProcessesScience, 2012
- Toward modernizing the systematic review pipeline in genetics: efficient updating via data miningGenetics in Medicine, 2012
- Studying the potential impact of automated document classification on scheduling a systematic review updateBMC Medical Informatics and Decision Making, 2012
- The judgement process in evidence-based medicine and health technology assessmentSocial Theory & Health, 2011
- Literature searching for social science systematic reviews: consideration of a range of search techniquesHealth Information and Libraries Journal, 2010
- Semi-automated screening of biomedical citations for systematic reviewsBMC Bioinformatics, 2010
- Unpacking your literature search toolbox: on search styles and tacticsHealth Information and Libraries Journal, 2008
- Searching for StudiesPublished by Wiley ,2008
- Reducing Workload in Systematic Review Preparation Using Automated Citation ClassificationJournal of the American Medical Informatics Association, 2006