Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance
Open Access
- 30 July 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in Chromosome Research
- Vol. 21 (5), 461-474
- https://doi.org/10.1007/s10577-013-9371-y
Abstract
Since oligonucleotide composition in the genome sequence varies significantly among species even among those possessing the same genome G + C%, the composition has been used to distinguish a wide range of genomes and called as “genome signature”. Oligonucleotides often represent motif sequences responsible for sequence-specific protein binding (e.g., transcription-factor binding). Occurrences of such motif oligonucleotides in the genome should be biased compared to those observed in random sequences and may differ among genomes and genomic portions. Self-Organizing Map (SOM) is a powerful tool for clustering high-dimensional data such as oligonucleotide composition on one plane. We previously modified the conventional SOM for genome informatics to batch learning SOM or “BLSOM”. When we constructed BLSOMs to analyze pentanucleotide composition in 20-, 50-, and 100-kb sequences derived from the human genome, BLSOMs did not classify human sequences according to chromosome but revealed several specific zones composed primarily of sequences derived from pericentric regions. Interestingly, various transcription-factor-binding motifs were characteristically overrepresented in pericentric regions but underrepresented in most genomic sequences. When we focused on much shorter sequences (e.g., 1 kb), the clustering of transcription-factor-binding motifs was evident in pericentric, subtelomeric and sex chromosome pseudoautosomal regions. The biological significance of the clustering in these regions was discussed in connection with cell-type and -stage-dependent chromocenter formation and nuclear organization.Keywords
This publication has 23 references indexed in Scilit:
- On the Immortality of Television Sets: "Function" in the Human Genome According to the Evolution-Free Gospel of ENCODEGenome Biology and Evolution, 2013
- Genome-wide transcription factor binding: beyond direct target regulationTrends in Genetics, 2011
- SUMOylation promotes de novo targeting of HP1α to pericentric heterochromatinNature Genetics, 2011
- A Strand-Specific Burst in Transcription of Pericentric Satellites Is Required for Chromocenter Formation and Early Mouse DevelopmentDevelopmental Cell, 2010
- DNA Binding of Centromere Protein C (CENPC) Is Stabilized by Single-Stranded RNAPLoS Genetics, 2010
- Epigenetic inheritance during the cell cycleNature Reviews Molecular Cell Biology, 2009
- Centromere RNA is a key component for the assembly of nucleoproteins at the nucleolus and centromereGenome Research, 2007
- Informatics for Unveiling Hidden Genome SignaturesGenome Research, 2003
- Engineering applications of the self-organizing mapProceedings of the IEEE, 1996
- Global variation in G + C content along vertebrate genome DNA: Possible correlation with chromosome band structuresJournal of Molecular Biology, 1988