Predicting Enzyme Subclass by Functional Domain Composition and Pseudo Amino Acid Composition

8 April 2005

journal article
Published by American Chemical Society (ACS) in Journal of Proteome Research

Vol. 4 (3), 967-971
https://doi.org/10.1021/pr0500399

Abstract

As a continuous effort to use the sequence approach to identify enzymatic function at a deeper level, investigations are extended from the main enzyme classes (Protein Sci. 2004, 13, 2857-2863) to their subclasses. This is indispensable if we wish to understand the molecular mechanism of an enzyme at a deeper level. For each of the 6 main enzyme classes (i.e., oxidoreductase, transferase, hydrolase, lyase, isomerase, and ligase), a subclass training dataset is constructed. To reduce homologous bias, a stringent cutoff was imposed that all the entries included in the datasets have less than 40% sequence identity to each other. To catch the core feature that is intimately related to the biological function, the sample of a protein is represented by hybridizing the functional domain composition and pseudo amino acid composition. On the basis of such a hybridization representation, the FunD-PseAA predictor is established. It is demonstrated by the jackknife cross-validation tests that the overall success rate in identifying the 21 subclasses of oxidoreductases is above 86%, and the corresponding rates in identifying the subclasses of the other 5 main enzyme classes are 94-97%. The high success rates imply that the FunD-PseAA predictor may become a useful tool in bioinformatics and proteomics of the post-genomic era.

Keywords

This publication has 12 references indexed in Scilit:

Validation of qualitative models of genetic regulatory networks by model checking: analysis of the nutritional stress response in Escherichia coli
Bioinformatics, 2005
Corrigendum to “Predicting protein structural class by functional domain composition” [Biochem. Biophys. Res. Commun. 321 (2004) 1007–1009]
Biochemical and Biophysical Research Communications, 2005
Enzyme family classification by support vector machines
Proteins-Structure Function and Bioinformatics, 2004
Subcellular location prediction of apoptosis proteins
Proteins-Structure Function and Bioinformatics, 2002
Enzyme Function Less Conserved than Anticipated
Journal of Molecular Biology, 2002
Some insights into protein structural class prediction
Proteins-Structure Function and Bioinformatics, 2001
Efficient gene activation in cultured mammalian cells mediated by FLP recombinase-expressing recombinant adenovirus
Nucleic Acids Research, 2001
Prediction of protein cellular attributes using pseudo‐amino acid composition
Proteins-Structure Function and Bioinformatics, 2001
A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition space
Proteins-Structure Function and Bioinformatics, 1995
Prediction of Protein Structural Classes from Amino Acid Compositions
Published by Springer Science and Business Media LLC ,1989

Cited by 73 articles