Relating CNVs to transcriptome data at fine resolution: Assessment of the effect of variant size, type, and overlap with functional regions
- 23 August 2011
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 21 (12), 2004-2013
- https://doi.org/10.1101/gr.122614.111
Abstract
Copy-number variants (CNVs) form an abundant class of genetic variation with a presumed widespread impact on individual traits. While recent advances, such as the population-scale sequencing of human genomes, facilitated the fine-scale mapping of CNVs, the phenotypic impact of most of these CNVs remains unclear. By relating copy-number genotypes to transcriptome sequencing data, we have evaluated the impact of CNVs, mapped at fine scale, on gene expression. Based on data from 129 individuals with ancestry from two populations, we identified CNVs associated with the expression of 110 genes, with 13% of the associations involving complex, multiallelic CNVs. Categorization of CNVs according to variant type, size, and gene overlap enabled us to examine the impact of different CNV classes on expression variation. While many small (<4 kb) CNVs were associated with expression variation, overall we observed an enrichment of large duplications and deletions, including large intergenic CNVs, relative to the entire set of expression-associated CNVs. Furthermore, the copy number of genes intersecting with CNVs typically correlated positively with the genes' expression, and also was more strongly correlated with expression than nearby single nucleotide polymorphisms, suggesting a frequent causal role of CNVs in expression quantitative trait loci (eQTLs). We also elucidated unexpected cases of negative correlations between copy number and expression by assessing the CNVs' effects on the structure and regulation of genes. Finally, we examined dosage compensation of transcript levels. Our results suggest that association studies can gain in resolution and power by including fine-scale CNV information, such as those obtained from population-scale sequencing.This publication has 47 references indexed in Scilit:
- Transcriptome genetics using second generation sequencing in a Caucasian populationNature, 2010
- Expression in Aneuploid Drosophila S2 CellsPLoS Biology, 2010
- Genomic Responses to Abnormal Gene Dosage: The X Chromosome Improved on a Common StrategyPLoS Biology, 2010
- Fast and SNP-tolerant detection of complex variants and splicing in short readsBioinformatics, 2010
- Origins and functional impact of copy number variation in the human genomeNature, 2009
- APOBEC3BDeletion and Risk of HIV‐1 AcquisitionThe Journal of Infectious Diseases, 2009
- Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent MannerScience, 2009
- Duplications of noncoding elements 5′ of SOX9 are associated with brachydactyly-anonychiaNature Genetics, 2009
- Copy number variants, diseases and gene expressionHuman Molecular Genetics, 2009
- High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene RegulationPLoS Genetics, 2008