Comprehensive Functional Annotation of Seventy-One Breast Cancer Risk Loci
Open Access
- 22 May 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (5), e63925
- https://doi.org/10.1371/journal.pone.0063925
Abstract
Breast Cancer (BCa) genome-wide association studies revealed allelic frequency differences between cases and controls at index single nucleotide polymorphisms (SNPs). To date, 71 loci have thus been identified and replicated. More than 320,000 SNPs at these loci define BCa risk due to linkage disequilibrium (LD). We propose that BCa risk resides in a subgroup of SNPs that functionally affects breast biology. Such a shortlist will aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs. We extracted all the SNPs, residing in 1 Mb windows around breast cancer risk index SNP from the 1000 genomes project to find correlated SNPs. We used FunciSNP, an R/Bioconductor package developed in-house, to identify potentially functional SNPs at 71 risk loci by coinciding them with chromatin biofeatures. We identified 1,005 SNPs in LD with the index SNPs (r2≥0.5) in three categories; 21 in exons of 18 genes, 76 in transcription start site (TSS) regions of 25 genes, and 921 in enhancers. Thirteen SNPs were found in more than one category. We found two correlated and predicted non-benign coding variants (rs8100241 in exon 2 and rs8108174 in exon 3) of the gene, ANKLE1. Most putative functional LD SNPs, however, were found in either epigenetically defined enhancers or in gene TSS regions. Fifty-five percent of these non-coding SNPs are likely functional, since they affect response element (RE) sequences of transcription factors. Functionality of these SNPs was assessed by expression quantitative trait loci (eQTL) analysis and allele-specific enhancer assays. Unbiased analyses of SNPs at BCa risk loci revealed new and overlooked mechanisms that may affect risk of the disease, thereby providing a valuable resource for follow-up studies.Keywords
This publication has 96 references indexed in Scilit:
- In vivo genome editing using a high-efficiency TALEN systemNature, 2012
- Comprehensive molecular portraits of human breast tumoursNature, 2012
- Extensive Promoter-Centered Chromatin Interactions Provide a Topological Basis for Transcription RegulationCell, 2012
- Differential genomic targeting of the transcription factor TAL1 in alternate haematopoietic lineagesThe EMBO Journal, 2010
- Long Noncoding RNAs with Enhancer-like Function in Human CellsCell, 2010
- Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell IdentitiesMolecular Cell, 2010
- Transcriptome genetics using second generation sequencing in a Caucasian populationNature, 2010
- Understanding mechanisms underlying human gene expression variation with RNA sequencingNature, 2010
- A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancerNature Genetics, 2007
- Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics, 2007