Consistent refinement of submitted models at CASP using a knowledge‐based potential

1 June 2010

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 78 (12), 2668-2678
https://doi.org/10.1002/prot.22781

Abstract

Protein structure refinement is an important but unsolved problem; it must be solved if we are to predict biological function that is very sensitive to structural details. Specifically, critical assessment of techniques for protein structure prediction (CASP) shows that the accuracy of predictions in the comparative modeling category is often worse than that of the template on which the homology model is based. Here we describe a refinement protocol that is able to consistently refine submitted predictions for all categories at CASP7. The protocol uses direct energy minimization of the knowledge‐based potential of mean force that is based on the interaction statistics of 167 atom types (Summa and Levitt, Proc Natl Acad Sci USA 2007; 104:3177–3182). Our protocol is thus computationally very efficient; it only takes a few minutes of CPU time to run typical protein models (300 residues). We observe an average structural improvement of 1% in GDT_TS, for predictions that have low and medium homology to known PDB structures (Global Distance Test score or GDT_TS between 50 and 80%). We also observe a marked improvement in the stereochemistry of the models. The level of improvement varies amongst the various participants at CASP, but we see large improvements (>10% increase in GDT_TS) even for models predicted by the best performing groups at CASP7. In addition, our protocol consistently improved the best predicted models in the refinement category at CASP7 and CASP8. These improvements in structure and stereochemistry prove the usefulness of our computationally inexpensive, powerful and automatic refinement protocol. Proteins 2010.

Keywords

This publication has 36 references indexed in Scilit:

SCWRL and MolIDE: computer programs for side-chain conformation prediction and homology modeling
Nature Protocols, 2008
Comparative Protein Structure Modeling Using MODELLER
Current Protocols in Protein Science, 2007
High-resolution structure prediction and the crystallographic phase problem
Nature, 2007
Template-based modeling and free modeling by I-TASSER in CASP7
Proteins-Structure Function and Bioinformatics, 2007
Growth of novel protein structural data
Proceedings of the National Academy of Sciences of the United States of America, 2007
High accuracy template based modeling by global optimization
Proteins-Structure Function and Bioinformatics, 2007
Physically realistic homology models built with rosetta can be more accurate than their templates
Proceedings of the National Academy of Sciences of the United States of America, 2006
The Impact of Structural Genomics: Expectations and Outcomes
Science, 2006
Comparative Protein Modelling by Satisfaction of Spatial Restraints
Journal of Molecular Biology, 1993
Accurate modeling of protein conformation by automatic segment matching
Journal of Molecular Biology, 1992

Cited by 40 articles