Contact prediction for beta and alpha-beta proteins using integer linear optimization and its impact on the first principles 3D structure prediction method ASTRO-FOLD
- 1 February 2010
- journal article
- research article
- Published by Wiley in Proteins: Structure, Function, and Bioinformatics
- Vol. 78 (8), 1825-1846
- https://doi.org/10.1002/prot.22696
Abstract
An integer linear optimization model is presented to predict residue contacts in β, α + β, and α/β proteins. The total energy of a protein is expressed as sum of a CαCα distance dependent contact energy contribution and a hydrophobic contribution. The model selects contact that assign lowest energy to the protein structure as satisfying a set of constraints that are included to enforce certain physically observed topological information. A new method based on hydrophobicity is proposed to find the β‐sheet alignments. These β‐sheet alignments are used as constraints for contacts between residues of β‐sheets. This model was tested on three independent protein test sets and CASP8 test proteins consisting of β, α + β, α/β proteins and it was found to perform very well. The average accuracy of the predictions (separated by at least six residues) was ∼61%. The average true positive and false positive distances were also calculated for each of the test sets and they are 7.58 Å and 15.88 Å, respectively. Residue contact prediction can be directly used to facilitate the protein tertiary structure prediction. This proposed residue contact prediction model is incorporated into the first principles protein tertiary structure prediction approach, ASTRO‐FOLD. The effectiveness of the contact prediction model was further demonstrated by the improvement in the quality of the protein structure ensemble generated using the predicted residue contacts for a test set of 10 proteins. Proteins 2010.Keywords
Funding Information
- National Science Foundation (R01 GM52032, R24 GM069736)
- U.S. Environmental Protection Agency (R 832721-010)
- U.S. Environmental Protection Agency's STAR Program (R 832721-010)
This publication has 82 references indexed in Scilit:
- Selecting High Quality Protein Structures from Diverse Conformational EnsemblesBiophysical Journal, 2009
- Enhanced bounding techniques to reduce the protein conformational search spaceOptimization Methods and Software, 2009
- An improved hybrid global optimization method for protein tertiary structure predictionComputational Optimization and Applications, 2009
- Towards accurate residue–residue hydrophobic contact prediction for α helical proteins via integer linear optimizationProteins: Structure, Function, and Bioinformatics, 2008
- Progress and challenges in protein structure predictionCurrent Opinion in Structural Biology, 2008
- A comprehensive assessment of sequence-based and template-based methods for protein contact predictionBioinformatics, 2008
- Refining intra-protein contact prediction by graph analysisBMC Bioinformatics, 2007
- Improved residue contact prediction using support vector machines and a large feature setBMC Bioinformatics, 2007
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresPeptide Science, 1983