Reconstructing history with amino acid sequences1

1 February 1992

journal article
research article
Published by Wiley in Protein Science

Vol. 1 (2), 191-200
https://doi.org/10.1002/pro.5560010201

Abstract

The main goal of the protein evolutionist is the reconstruction of past events leading to the structures of contemporary proteins. The common strategy is to align amino acid sequences and make inferences about matters of common ancestry. The rate of change of amino acid sequence varies greatly from protein to protein, and this naturally affects how far back a given protein's ancestry can be traced. Happily, the rate of change of many proteins is slow enough that very ancient events can be inferred. Many mainstream metabolic enzymes, for example, are 40–50% identical in prokaryotes and eukaryotes, groups that diverged from a common ancestor more than 1.5 billion years ago. Moreover, some eukaryotic proteins like actin and tubulin change so slowly that they are seldom less than 60% identical, no matter from what source they are drawn. As it happens, prokaryotic counterparts for many eukaryotic cytoskeletal proteins are unknown. A recent exception involves the finding that a heat shock protein cognate is a relative of actin. The gene duplication that gave rise to these two proteins must have been an ancient event. The more recent invention of other proteins whose distribution is restricted to one or the other of the major kingdoms may be easier to trace. Among the factors that can confound the reconstruction of events, however, are occasional horizontal gene transfers and exon shuffling. The latter has led to a number of mosaic proteins, many of which contain various combinations of a relatively small set of modules like the epidermal growth factor domain.

Keywords

Funding Information

National Institutes of Health

This publication has 37 references indexed in Scilit:

Crystal structure of chaperone protein PapD reveals an immunoglobulin fold
Nature, 1989
Sequence of an unusually large protein implicated in regulation of myosin activity in C. elegans
Nature, 1989
Origins and Evolutionary Relationships of Retroviruses
The Quarterly Review of Biology, 1989
A vaccine candidate from the sexual stage of human malaria that contains EGF-like domains
Nature, 1988
Intron‐dependent evolution: Preferred types of exons and introns
FEBS Letters, 1987
Primary sequence of a dimeric bacterial haemoglobin from Vitreoscilla
Nature, 1986
The genealogy of some recently evolved vertebrate proteins
Trends in Biochemical Sciences, 1985
Cassette of Eight Exons Shared by Genes for LDL Receptor and EGF Precursor
Science, 1985
Why genes in pieces?
Nature, 1978
Construction of Phylogenetic Trees
Science, 1967

Cited by 106 articles