Determinants of CpG Islands: Expression in Early Embryo and Isochore Structure
- 15 October 2001
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 11 (11), 1854-1860
- https://doi.org/10.1101/gr.174501
Abstract
In an attempt to understand the origin of CpG islands (CGIs) in mammalian genomes, we have studied their location and structure according to the expression pattern of genes and to the G + C content of isochores in which they are embedded. We show that CGIs located over the transcription start site (named start CGIs) are very different structurally from the others (named no-start CGIs): (1) 61.6% of the no-start CGIs are due to repeated sequences (79 % are due to Alus), whereas only 5.6% of the start CGIs are due to such repeats; (2) start CGIs are longer and display a higher CpGo/e ratio and G + C level than no-start CGIs. The frequency of tissue-specific genes associated to a start CGI varies according to the genomic G + C content, from 25% in G + C-poor isochores to 64% in G + C-rich isochores. Conversely, the frequency of housekeeping genes associated to a start CGI (90%) is independent of the isochore context. Interestingly, the structure of start CGIs is very similar for tissue-specific and housekeeping genes. Moreover, 93% of genes expressed in early embryo are found to exhibit a CpG island over their transcription start point. These observations are consistent with the hypothesis that the occurrence of these CGIs is the consequence of gene expression at this stage, when the methylation pattern is installed.Keywords
This publication has 49 references indexed in Scilit:
- Initial sequencing and analysis of the human genomeNature, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Sp1 sites in the mouse aprt gene promoter are required to prevent methylation of the CpG island.Genes & Development, 1994
- Evidence for erosion of mouse CpG islands during mammalian evolutionSomatic Cell and Molecular Genetics, 1993
- Master genes in mammalian repetitive DNA amplificationTrends in Genetics, 1992
- Master genes in mammalian repetitive DNA amplificationTrends in Genetics, 1992
- The Alu family developed through successive waves of fixation closely connected with primate lineage historyJournal of Molecular Evolution, 1988
- CpG Islands in vertebrate genomesJournal of Molecular Biology, 1987
- ACNUC – a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usageBioinformatics, 1985
- Molecular basis of base substitution hotspots in Escherichia coliNature, 1978