Strategies for identifying pregnancies in the automated medical records of the General Practice Research Database
- 8 January 2004
- journal article
- research article
- Published by Wiley in Pharmacoepidemiology and Drug Safety
- Vol. 13 (11), 749-759
- https://doi.org/10.1002/pds.935
Abstract
Purpose To develop a method for identifying the beginning and ending records of pregnancies in the automated medical records of the General Practice Research Database (GPRD). Methods Women's records from 1991 to 1999 were searched for codes from 17 pregnancy marker and 7 pregnancy outcome categories. Using the retrieved records, all possible pregnancy marker-outcome combinations were formed per woman. For each combination, the difference in days between record event dates was calculated. Restrictions were applied to select the combination with the earliest pregnancy marker mapped to the first outcome for each pregnancy. Iterations of the algorithm identified multiple pregnancies per woman when present. The algorithm was evaluated by analyzing time between marker and outcome event dates of mapped pregnancies and by analyzing unmapped pregnancy markers and outcomes. Results A total of 297 082 pregnancies were identified: 80% by general practitioner (GP) visit codes as the earliest pregnancy marker and 14% by laboratory or procedure codes. Limiting pregnancies to one per woman aged 15–44 years yielded 209 266 pregnancies. Pregnancy mapping success was greater than 80%. Plotting the pregnancies by weeks from earliest pregnancy marker to outcome and by pregnancy marker category showed two peaks in the distribution: 2–3 weeks and 33 weeks. Conclusions Arranging codes and time into algorithms provides a useful tool for pregnancy identification in databases whose size prohibits the audit of printed records. Evaluation of our algorithm confirmed a high degree of mapping success and a sensible time distribution from pregnancy marker to outcome. Copyright © 2004 John Wiley & Sons, Ltd.Keywords
This publication has 13 references indexed in Scilit:
- Pattern recognition in health insurance claims databasesPharmacoepidemiology and Drug Safety, 2001
- Revitalizing the General Practice Research Database: plans, challenges, and opportunitiesPharmacoepidemiology and Drug Safety, 2001
- Use of an Automated Database to Evaluate Markers for Early Detection of PregnancyAmerican Journal of Epidemiology, 2001
- Use of the UK General Practice Research Database for pharmacoepidemiologyBritish Journal of Clinical Pharmacology, 1998
- The UK General Practice Research DatabaseThe Lancet, 1997
- The quality of information recorded on a UK database of primary care records: A study of hospitalizations due to hypoglycemia and other conditionsPharmacoepidemiology and Drug Safety, 1994
- Reproductive Function and Contraception in the Postpartum PeriodObstetrical & Gynecological Survey, 1994
- Further validation of information recorded on a general practitioner based computerized data resource in the united kingdomPharmacoepidemiology and Drug Safety, 1992
- Pharmacoepidemiology using a UK database of primary care recordsPharmacoepidemiology and Drug Safety, 1992
- Validation of information recorded on general practitioner based computerised data resource in the United Kingdom.BMJ, 1991