Fast and effective protein model refinement using deep graph neural networks
- 15 July 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Computational Science
- Vol. 1 (7), 462-469
- https://doi.org/10.1038/s43588-021-00098-9
Abstract
Protein model refinement is the last step applied to improve the quality of a predicted protein model. Currently, the most successful refinement methods rely on extensive conformational sampling and thus take hours or days to refine even a single protein model. Here, we propose a fast and effective model refinement method that applies graph neural networks (GNNs) to predict a refined inter-atom distance probability distribution from an initial model and then rebuilds three-dimensional models from the predicted distance distribution. Tested on the Critical Assessment of Structure Prediction refinement targets, our method has an accuracy that is comparable to those of two leading human groups (FEIG and BAKER), but runs substantially faster. Our method may refine one protein model within ~11 min on one CPU, whereas BAKER needs ~30 h on 60 CPUs and FEIG needs ~16 h on one GPU. Finally, our study shows that GNN outperforms ResNet (convolutional residual neural networks) for model refinement when very limited conformational sampling is allowed.Keywords
Funding Information
- Foundation for the National Institutes of Health (R01GM089753)
- National Science Foundation (DBI1564955)
This publication has 32 references indexed in Scilit:
- 3Drefine: an interactive web server for efficient protein structure refinementNucleic Acids Research, 2016
- Relaxation of backbone bond geometry improves protein energy landscape modelingProtein Science, 2013
- lDDT: a local superposition-free score for comparing protein structures and models using distance difference testsBioinformatics, 2013
- Physics‐based protein structure refinement through multiple molecular dynamics trajectories and structure averagingProteins, 2013
- GalaxyRefine: protein structure refinement driven by side-chain repackingNucleic Acids Research, 2013
- Improving the Physical Realism and Structural Accuracy of Protein Models by a Two-Step Atomic-Level Energy MinimizationBiophysical Journal, 2011
- A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure PredictionPLOS ONE, 2010
- PyRosetta: a script-based interface for implementing molecular modeling algorithms using RosettaBioinformatics, 2010
- Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability predictionProtein Science, 2002
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresPeptide Science, 1983