Reconstructing history with amino acid sequences1
- 1 February 1992
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 1 (2), 191-200
- https://doi.org/10.1002/pro.5560010201
Abstract
The main goal of the protein evolutionist is the reconstruction of past events leading to the structures of contemporary proteins. The common strategy is to align amino acid sequences and make inferences about matters of common ancestry. The rate of change of amino acid sequence varies greatly from protein to protein, and this naturally affects how far back a given protein's ancestry can be traced. Happily, the rate of change of many proteins is slow enough that very ancient events can be inferred. Many mainstream metabolic enzymes, for example, are 40–50% identical in prokaryotes and eukaryotes, groups that diverged from a common ancestor more than 1.5 billion years ago. Moreover, some eukaryotic proteins like actin and tubulin change so slowly that they are seldom less than 60% identical, no matter from what source they are drawn. As it happens, prokaryotic counterparts for many eukaryotic cytoskeletal proteins are unknown. A recent exception involves the finding that a heat shock protein cognate is a relative of actin. The gene duplication that gave rise to these two proteins must have been an ancient event. The more recent invention of other proteins whose distribution is restricted to one or the other of the major kingdoms may be easier to trace. Among the factors that can confound the reconstruction of events, however, are occasional horizontal gene transfers and exon shuffling. The latter has led to a number of mosaic proteins, many of which contain various combinations of a relatively small set of modules like the epidermal growth factor domain.Keywords
Funding Information
- National Institutes of Health
This publication has 37 references indexed in Scilit:
- Crystal structure of chaperone protein PapD reveals an immunoglobulin foldNature, 1989
- Sequence of an unusually large protein implicated in regulation of myosin activity in C. elegansNature, 1989
- Origins and Evolutionary Relationships of RetrovirusesThe Quarterly Review of Biology, 1989
- A vaccine candidate from the sexual stage of human malaria that contains EGF-like domainsNature, 1988
- Intron‐dependent evolution: Preferred types of exons and intronsFEBS Letters, 1987
- Primary sequence of a dimeric bacterial haemoglobin from VitreoscillaNature, 1986
- The genealogy of some recently evolved vertebrate proteinsTrends in Biochemical Sciences, 1985
- Cassette of Eight Exons Shared by Genes for LDL Receptor and EGF PrecursorScience, 1985
- Why genes in pieces?Nature, 1978
- Construction of Phylogenetic TreesScience, 1967