Modular arrangement of proteins as inferred from analysis of homology

Open Access

1 March 1994

journal article
research article
Published by Wiley in Protein Science

Vol. 3 (3), 482-492
https://doi.org/10.1002/pro.5560030314

Abstract

The structure of many proteins consists of a combination of discrete modules that have been shuffled during evolution. Such modules can frequently be recognized from the analysis of homology. Here we present a systematic analysis of the modular organization of all sequenced proteins. To achieve this we have developed an automatic method to identify protein domains from sequence comparisons. Homologous domains can then be clustered into consistent families. The method was applied to all 21,098 nonfragment protein sequences in SWISS-PROT 21.0, which was automatically reorganized into a comprehensive protein domain database, ProDom. We have constructed multiple sequence alignments for each domain family in ProDom, from which consensus sequences were generated. These nonredundant domain consensuses are useful for fast homology searches. Domain organization in ProDom is exemplified for proteins of the phosphoenolpyruvate: sugar phosphotransferase system (PEP:PTS) and for bacterial 2-component regulators. We provide 2 examples of previously unrecognized domain arrangements discovered with the help of, ProDom.

Keywords

This publication has 42 references indexed in Scilit:

The evolution of protein domains and the organizational complexities of metazoans
Current Opinion in Genetics & Development, 1992
COMMUNICATION MODULES IN BACTERIAL SIGNALING PROTEINS
Annual Review of Genetics, 1992
One thousand families for the molecular biologist
Nature, 1992
Reconstructing history with amino acid sequences1
Protein Science, 1992
A new family of powerful multivariate statistical sequence analysis techniques
Journal of Molecular Biology, 1991
Amino acid substitution matrices from an information theoretic perspective
Journal of Molecular Biology, 1991
Basic local alignment search tool
Journal of Molecular Biology, 1990
Structure and evolution of a multidomain multiphosphoryl transfer protein: Nucleotide sequence of the fruB(HI) gene in Rhodobacter capsulatus and comparisons with homologous genes from other organisms
Journal of Molecular Biology, 1990
Efficient recognition of immunoglobulin domains from amino acid sequences using a neural network
Bioinformatics, 1990
Recognition of functional regions in primary structures using a set of property patterns
FEBS Letters, 1989

Cited by 188 articles