Proteomic signatures: Amino acid and oligopeptide compositions differentiate among phyla
- 19 December 2003
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 54 (1), 20-40
- https://doi.org/10.1002/prot.10559
Abstract
Availability of complete genome sequences allows in‐depth comparison of single‐residue and oligopeptide compositions of the corresponding proteomes. We have used principal component analysis (PCA) to study the landscape of compositional motifs across more than 70 genera from all three superkingdoms. Unexpectedly, the first two principal components clearly differentiate archaea, eubacteria, and eukaryota from each other. In particular, we contrast compositional patterns typical of the three superkingdoms and characterize differences between species and phyla, as well as among patterns shared by all compositional proteomic signatures. These species‐specific patterns may even extend to subsets of the entire proteome, such as proteins pertaining to individual yeast chromosomes. We identify factors that affect compositional signatures, such as living habitat, and detect strong eukaryotic preference for homopeptides and palindromic tripeptides. We further detect oligopeptides that are either universally over‐ or underabundant across the whole proteomic landscape, as well as oligopeptides whose over‐ or underabundance is phylum‐ or species‐specific. Finally, we report that species composition signatures preserve evolutionary memory, providing a new method to compare phylogenetic relationships among species that avoids problems of sequence alignment and ortholog detection. Proteins 2004.Keywords
This publication has 33 references indexed in Scilit:
- Ancient horizontal gene transferNature Reviews Genetics, 2003
- The origin and evolution of model organismsNature Reviews Genetics, 2002
- SHOT: a web server for the construction of genome phylogeniesTrends in Genetics, 2002
- PRiMA: The Membrane Anchor of Acetylcholinesterase in the BrainNeuron, 2002
- Genome-Scale Compositional Comparisons in EukaryotesGenome Research, 2001
- Hyperthermophilic Enzymes: Sources, Uses, and Molecular Mechanisms for ThermostabilityMicrobiology and Molecular Biology Reviews, 2001
- Intrinsically unstructured proteins: re-assessing the protein structure-function paradigmJournal of Molecular Biology, 1999
- A structural census of genomes: comparing bacterial, eukaryotic, and archaeal genomes in terms of protein structureJournal of Molecular Biology, 1997
- Dinucleotide relative abundance extremes: a genomic signatureTrends in Genetics, 1995
- Principal component analysis and exploratory factor analysisStatistical Methods in Medical Research, 1992