Comparing data mining methods on the VAERS database
Open Access
- 26 August 2005
- journal article
- research article
- Published by Wiley in Pharmacoepidemiology and Drug Safety
- Vol. 14 (9), 601-609
- https://doi.org/10.1002/pds.1107
Abstract
Purpose Data mining may enhance traditional surveillance of vaccine adverse events by identifying events that are reported more commonly after administering one vaccine than other vaccines. Data mining methods find signals as the proportion of times a condition or group of conditions is reported soon after the administration of a vaccine; thus it is a relative proportion compared across vaccines, and not an absolute rate for the condition. The Vaccine Adverse Event Reporting System (VAERS) contains approximately 150 000 reports of adverse events that are possibly associated with vaccine administration. Methods We studied four data mining techniques: empirical Bayes geometric mean (EBGM), lower‐bound of the EBGM's 90% confidence interval (EB05), proportional reporting ratio (PRR), and screened PRR (SPRR). We applied these to the VAERS database and compared the agreement among methods and other performance properties, particularly focusing on the vaccine–event combinations with the highest numerical scores in the various methods. Results The vaccine–event combinations with the highest numerical scores varied substantially among the methods. Not all combinations representing known associations appeared in the top 100 vaccine–event pairs for all methods. Conclusions The four methods differ in their ranking of vaccine–COSTART pairs. A given method may be superior in certain situations but inferior in others. This paper examines the statistical relationships among the four estimators. Determining which method is best for public health will require additional analysis that focuses on the true alarm and false alarm rates using known vaccine–event associations. Evaluating the properties of these data mining methods will help determine the value of such methods in vaccine safety surveillance. Copyright © 2005 John Wiley & Sons, Ltd.Keywords
This publication has 18 references indexed in Scilit:
- Quantitative Methods in PharmacovigilanceDrug Safety, 2003
- Data-mining analyses of pharmacovigilance signals in relation to relevant comparison drugsEuropean Journal of Clinical Pharmacology, 2002
- Advanced Age a Risk Factor for Illness Temporally Associated with Yellow Fever VaccinationEmerging Infectious Diseases, 2001
- Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reportsPharmacoepidemiology and Drug Safety, 2001
- Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting SystemThe American Statistician, 1999
- Principles of Signal Detection in PharmacovigilanceDrug Safety, 1997
- The reporting sensitivities of two passive surveillance systems for vaccine adverse events.American Journal of Public Health, 1995
- Natural language processing in an operational clinical information systemNatural Language Engineering, 1995
- The vaccine adverse event reporting system (VAERS)Vaccine, 1994
- Models for Contingency Tables With Known Margins When Target and Sampled Populations DifferJournal of the American Statistical Association, 1991