3D‐SHOTGUN: A novel, cooperative, fold‐recognition meta‐predictor

8 April 2003

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 51 (3), 434-441
https://doi.org/10.1002/prot.10357

Abstract

To gain a better understanding of the biological role of proteins encoded in genome sequences, knowledge of their three-dimensional (3D) structure and function is required. The computational assignment of folds is becoming an increasingly important complement to experimental structure determination. In particular, fold-recognition methods aim to predict approximate 3D models for proteins bearing no sequence similarity to any protein of known structure. However, fully automated structure-prediction methods can currently produce reliable models for only a fraction of these sequences. Using a number of semiautomated procedures, human expert predictors are often able to produce more and better predictions than automated methods. We describe a novel, fully automatic, fold-recognition meta-predictor, named 3D-SHOTGUN, which incorporates some of the strategies human predictors have successfully applied. This new method is reminiscent of the so-called cooperative algorithms of Computer Vision. The input to 3D-SHOTGUN are the top models predicted by a number of independent fold-recognition servers. The meta-predictor consists of three steps: (i) assembly of hybrid models, (ii) confidence assignment, and (iii) selection. We have applied 3D-SHOTGUN to an unbiased test set of 77 newly released protein structures sharing no sequence similarity to proteins previously released. Forty-six correct rank-1 predictions were obtained, 30 of which had scores higher than that of the first incorrect prediction—a significant improvement over the performance of all individual servers. Furthermore, the predicted hybrid models were, on average, more similar to their corresponding native structures than those produced by the individual servers. This opens the possibility of generating more accurate, full-atom homology models for proteins with no sequence similarity to proteins of known structure. These improvements represent a step forward toward the wider applicability of fully automated structure-prediction methods at genome scales. Proteins 2003;51:434–441.

Keywords

This publication has 29 references indexed in Scilit:

Assessment of the CASP4 fold recognition category
Proteins-Structure Function and Bioinformatics, 2001
Comparison of sequence profiles. Strategies for structural predictions using sequence information
Protein Science, 2000
Ab Initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions
Journal of Molecular Biology, 1999
GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences
Journal of Molecular Biology, 1999
Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and bayesian scoring functions
Journal of Molecular Biology, 1997
Comparative Protein Modelling by Satisfaction of Spatial Restraints
Journal of Molecular Biology, 1993
A new method for building protein conformations from sequence alignments with homologues of known structure
Journal of Molecular Biology, 1991
Knowledge-based prediction of protein structures and the design of novel molecules
Nature, 1987
Comparative model-building of the mammalian serine proteases
Journal of Molecular Biology, 1981
The protein data bank: A computer-based archival file for macromolecular structures
Journal of Molecular Biology, 1977

Cited by 138 articles