Power and instrument strength requirements for Mendelian randomization studies using multiple genetic variants
Top Cited Papers
Open Access
- 15 September 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in International Journal of Epidemiology
- Vol. 40 (3), 740-752
- https://doi.org/10.1093/ije/dyq151
Abstract
Background Mendelian Randomization (MR) studies assess the causality of an exposure–disease association using genetic determinants [i.e. instrumental variables (IVs)] of the exposure. Power and IV strength requirements for MR studies using multiple genetic variants have not been explored. Methods We simulated cohort data sets consisting of a normally distributed disease trait, a normally distributed exposure, which affects this trait and a biallelic genetic variant that affects the exposure. We estimated power to detect an effect of exposure on disease for varying allele frequencies, effect sizes and samples sizes (using two-stage least squares regression on 10 000 data sets—Stage 1 is a regression of exposure on the variant. Stage 2 is a regression of disease on the fitted exposure). Similar analyses were conducted using multiple genetic variants (5, 10, 20) as independent or combined IVs. We assessed IV strength using the first-stage F statistic. Results Simulations of realistic scenarios indicate that MR studies will require large (n > 1000), often very large (n > 10 000), sample sizes. In many cases, so-called ‘weak IV’ problems arise when using multiple variants as independent IVs (even with as few as five), resulting in biased effect estimates. Combining genetic factors into fewer IVs results in modest power decreases, but alleviates weak IV problems. Ideal methods for combining genetic factors depend upon knowledge of the genetic architecture underlying the exposure. Conclusions The feasibility of well-powered, unbiased MR studies will depend upon the amount of variance in the exposure that can be explained by known genetic factors and the ‘strength’ of the IV set derived from these genetic factors.Keywords
This publication has 60 references indexed in Scilit:
- Genome-wide Association Study of Vitamin B6, Vitamin B12, Folate, and Homocysteine Blood ConcentrationsAmerican Journal of Human Genetics, 2009
- Common Variation in the β-Carotene 15,15′-Monooxygenase 1 Gene Affects Circulating Levels of Carotenoids: A Genome-wide Association StudyAmerican Journal of Human Genetics, 2009
- A Genome-wide Association Study Identifies Three Loci Associated with Mean Platelet VolumeAmerican Journal of Human Genetics, 2009
- Association of three genetic loci with uric acid concentration and risk of gout: a genome-wide association studyThe Lancet, 2008
- Population-Based Genome-wide Association Studies Reveal Six Loci Influencing Plasma Levels of Liver EnzymesAmerican Journal of Human Genetics, 2008
- Revealing the architecture of gene regulation: the promise of eQTL studiesTrends in Genetics, 2008
- Polymorphisms of the HNF1A Gene Encoding Hepatocyte Nuclear Factor-1α are Associated with C-Reactive ProteinAmerican Journal of Human Genetics, 2008
- Loci Related to Metabolic-Syndrome Pathways Including LEPR,HNF1A, IL6R, and GCKR Associate with Plasma C-Reactive Protein: The Women's Genome Health StudyAmerican Journal of Human Genetics, 2008
- ‘Mendelian randomization’ equals instrumental variable analysis with genetic instrumentsStatistics in Medicine, 2008
- Genome-wide Association Study Identifies Genes for Biomarkers of Cardiovascular Disease: Serum Urate and DyslipidemiaAmerican Journal of Human Genetics, 2008