Compilation and analysis of sequences upstream from the translational start site in eukaryotic mRNAs
- 1 January 1984
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 12 (2), 857-872
- https://doi.org/10.1093/nar/12.2.857
Abstract
5-Noncoding sequences have been tabulated for 211 messenger RNAs from higher eukaryotic cells. The 5'-proximal AUG triplet serves as the initiator codon in 95% of the mRNAs examined. The most conspicuous conserved feature is the presence of a purine (most often A) three nucleotides upstream from the AUG initiator codon; only 6 of the mRNAs in the survey have a pyrimidine in that position. There is a predominance of C in positions -1, -2, -4 and -5, just upstream from the initiator codon. The sequence CCAGCCAUG (G) thus emerges as a consensus sequence for eukaryotic initiation sites. The extent to which the ribosome binding site in a given mRNA matches the -1 to -5 consensus sequence varies: more than half of the mRNAs in the tabulation have 3 or 4 nucleotides in common with the CCACC consensus, but only ten mRNAs conform perfectly.Keywords
This publication has 98 references indexed in Scilit:
- DNA sequences, gene regulation and modular protein evolution in the Drosophila 68C glue gene clusterJournal of Molecular Biology, 1983
- Nucleotide sequence of cloned cDNA of human c-myc oncogeneNature, 1983
- Sequence and structure conservation in yolk proteins and their genesJournal of Molecular Biology, 1983
- Structure and sequence of the cellular gene homologous to the RSV src gene and the mechanism for generating the transforming virusCell, 1983
- Tissue-specific control of α2u globulin gene expression: Constitutive synthesis in the submaxillary glandCell, 1983
- Nucleotide sequence of bovine parathyroid hormone messenger RNAMolecular and Cellular Endocrinology, 1982
- The structure of the human zeta-globin gene and a closely linked, nearly identical pseudogeneCell, 1982
- Functional significance and evolutionary development of the 5′-terminal regions of immunoglobulin variable-region genesCell, 1982
- DNA Sequence of Two Closely Linked Human Leukocyte Interferon GenesScience, 1981
- The structure of one of the eight or more distinct chromosomal genes for human interferon-αNature, 1980