Microsatellite Tandem Repeats Are Abundant in Human Promoters and Are Associated with Regulatory Elements
Open Access
- 6 February 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (2), e54710
- https://doi.org/10.1371/journal.pone.0054710
Abstract
Tandem repeats are genomic elements that are prone to changes in repeat number and are thus often polymorphic. These sequences are found at a high density at the start of human genes, in the gene’s promoter. Increasing empirical evidence suggests that length variation in these tandem repeats can affect gene regulation. One class of tandem repeats, known as microsatellites, rapidly alter in repeat number. Some of the genetic variation induced by microsatellites is known to result in phenotypic variation. Recently, our group developed a novel method for measuring the evolutionary conservation of microsatellites, and with it we discovered that human microsatellites near transcription start sites are often highly conserved. In this study, we examined the properties of microsatellites found in promoters. We found a high density of microsatellites at the start of genes. We showed that microsatellites are statistically associated with promoters using a wavelet analysis, which allowed us to test for associations on multiple scales and to control for other promoter related elements. Because promoter microsatellites tend to be G/C rich, we hypothesized that G/C rich regulatory elements may drive the association between microsatellites and promoters. Our results indicate that CpG islands, G-quadruplexes (G4) and untranslated regulatory regions have highly significant associations with microsatellites, but controlling for these elements in the analysis does not remove the association between microsatellites and promoters. Due to their intrinsic lability and their overlap with predicted functional elements, these results suggest that many promoter microsatellites have the potential to affect human phenotypes by generating mutations in regulatory elements, which may ultimately result in disease. We discuss the potential functions of human promoter microsatellites in this context.Keywords
This publication has 96 references indexed in Scilit:
- Hybrid error correction and de novo assembly of single-molecule sequencing readsNature Biotechnology, 2012
- A regulatory role for repeated decoy transcription factor binding sites in target gene expressionMolecular Systems Biology, 2012
- Evaluation of microsatellite variation in the 1000 Genomes Project pilot studies is indicative of the quality and utility of the raw data and alignmentsGenomics, 2011
- Galaxy: A Web‐Based Genome Analysis Tool for ExperimentalistsCurrent Protocols in Molecular Biology, 2010
- Selection for the G4 DNA motif at the 5′ end of human genesMolecular Carcinogenesis, 2009
- UTR dinucleotide simple sequence repeat evolution exhibits recurring patterns including regulatory sequence motif replacementsGene, 2009
- Embryonic nervous system genes predominate in searches for dinucleotide simple sequence repeats flanked by conserved sequencesGene, 2009
- Structures, folding patterns, and functions of intramolecular DNA G-quadruplexes found in eukaryotic promoter regionsBiochimie, 2008
- Genome analysis of the platypus reveals unique signatures of evolutionNature, 2008
- An RNA G-quadruplex in the 5′ UTR of the NRAS proto-oncogene modulates translationNature Chemical Biology, 2007