Superfamily active site templates
- 2 April 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 55 (4), 962-976
- https://doi.org/10.1002/prot.20099
Abstract
We show that three‐dimensional signatures consisting of only a few functionally important residues can be diagnostic of membership in superfamilies of enzymes. Using the enolase superfamily as a model system, we demonstrate that such a signature, or template, can identify superfamily members in structural databases with high sensitivity and specificity. This is remarkable because superfamilies can be highly diverse, with members catalyzing many different overall reactions; the unifying principle can be a conserved partial reaction or chemical capability. Our definition of a superfamily thus hinges on the disposition of residues involved in a conserved function, rather than on fold similarity alone. A clear advantage of basing structure searches on such active site templates rather than on fold similarity is the specificity with which superfamilies with distinct functional characteristics can be identified within a large set of proteins with the same fold, such as the (β/α)8 barrels. Preliminary results are presented for an additional group of enzymes with a different fold, the haloacid dehalogenase superfamily, suggesting that this approach may be generally useful for assigning reading frames of unknown function to specific superfamilies and thereby allowing inference of some of their functional properties. Proteins 2004;9999:000–000.Keywords
This publication has 85 references indexed in Scilit:
- The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coliJournal of Molecular Biology, 2001
- Evolution of Enzymatic Activity in the Enolase Superfamily: Structure of o-Succinylbenzoate Synthase from Escherichia coli in Complex with Mg2+ and o-Succinylbenzoate,Biochemistry, 2000
- Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scoresJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Recognition of spatial motifs in protein structures 1 1Edited by J. ThorntonJournal of Molecular Biology, 1999
- Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to Glutaredoxins/Thioredoxins and T 1 Ribonucleases 1 1Edited by F. CohenJournal of Molecular Biology, 1998
- Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipasesProtein Science, 1996
- Threading a database of protein coresProteins-Structure Function and Bioinformatics, 1995
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresJournal of Molecular Biology, 1995
- The role of lysine 166 in the mechanism of mandelate racemase from Pseudomonas putida: Mechanistic and crystallographic evidence for stereospecific alkylation by (R)-.alpha.-phenylglycidateBiochemistry, 1994