Cascading epigenomic analysis for identifying disease genes from the regulatory landscape of GWAS variants
Open Access
- 22 November 2021
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Genetics
- Vol. 17 (11), e1009918
- https://doi.org/10.1371/journal.pgen.1009918
Abstract
The majority of genetic variants detected in genome wide association studies (GWAS) exert their effects on phenotypes through gene regulation. Motivated by this observation, we propose a multi-omic integration method that models the cascading effects of genetic variants from epigenome to transcriptome and eventually to the phenome in identifying target genes influenced by risk alleles. This cascading epigenomic analysis for GWAS, which we refer to as CEWAS, comprises two types of models: one for linking cis genetic effects to epigenomic variation and another for linking cis epigenomic variation to gene expression. Applying these models in cascade to GWAS summary statistics generates gene level statistics that reflect genetically-driven epigenomic effects. We show on sixteen brain-related GWAS that CEWAS provides higher gene detection rate than related methods, and finds disease relevant genes and gene sets that point toward less explored biological processes. CEWAS thus presents a novel means for exploring the regulatory landscape of GWAS variants in uncovering disease mechanisms. The majority of genetic variants detected in genome wide association studies (GWAS) exert their effects on phenotypes through gene regulation. Motivated by this observation, we propose a multi-omic integration method that models the cascading effects of genetic variants from epigenome to transcriptome and eventually to the phenome in identifying target genes influenced by risk alleles. This cascading epigenomic analysis for GWAS, which we refer to as CEWAS, combines the effect of genetic variants on DNA methylation as well as gene expression. We show on sixteen brain-related GWAS that CEWAS provides higher gene detection rate than related methods, and finds disease relevant genes and gene sets that point toward less explored biological processes.Funding Information
- National Institute on Aging (P30AG10161)
- National Institute on Aging (U01 AG046152)
- National Institute on Aging (U01AG61356)
- National Institute on Aging (AG057911)
- National Institute on Aging (AG061798)
- National Institute on Aging (P30AG72975)
- National Institute on Aging (R01AG15819)
This publication has 106 references indexed in Scilit:
- An integrated map of genetic variation from 1,092 human genomesNature, 2012
- Linking disease associations with regulatory information in the human genomeGenome Research, 2012
- Physiological consequences of defects in ERCC1–XPF DNA repair endonucleaseDNA Repair, 2011
- Mapping and analysis of chromatin state dynamics in nine human cell typesNature, 2011
- Oxidative stress in schizophrenia: An integrated approachNeuroscience & Biobehavioral Reviews, 2011
- The mammalian phenotype ontology: enabling robust annotation and comparative analysisWires Systems Biology and Medicine, 2009
- Potential etiologic and functional implications of genome-wide association loci for human diseases and traitsProceedings of the National Academy of Sciences of the United States of America, 2009
- The Dopamine Hypothesis of Schizophrenia: Version III--The Final Common PathwaySchizophrenia Bulletin, 2009
- How mitochondria produce reactive oxygen speciesBiochemical Journal, 2008
- Oxidative Stress in Parkinson's DiseaseAnnals of the New York Academy of Sciences, 2008