Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment
Open Access
- 30 May 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in Scientific Reports
- Vol. 3 (1), srep01895
- https://doi.org/10.1038/srep01895
Abstract
Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-score 17% higher than that by traditional comparative modeling methods. For 495 unknown hard sequences, 72 are predicted to have a correct fold (TM-score > 0.5) and 321 have a substantial portion of structure correctly modeled (TM-score > 0.35). 317 sequences can be reliably assigned to a SCOP fold family based on structural analogy to existing proteins in PDB. The presented results, as a case study of E. coli, represent promising progress towards genome-wide structure modeling and fold family assignment using state-of-the-art ab initio folding algorithms.This publication has 41 references indexed in Scilit:
- Toward optimal fragment generations for ab initio protein structure assemblyProteins-Structure Function and Bioinformatics, 2012
- Improving the Physical Realism and Structural Accuracy of Protein Models by a Two-Step Atomic-Level Energy MinimizationBiophysical Journal, 2011
- CASP9 assessment of free modeling target predictionsProteins-Structure Function and Bioinformatics, 2011
- I-TASSER: a unified platform for automated protein structure and function predictionNature Protocols, 2010
- How significant is a protein structure similarity with TM-score = 0.5?Bioinformatics, 2010
- Progress and challenges in protein structure predictionCurrent Opinion in Structural Biology, 2008
- Structure and Function of the Escherichia coli Protein YmgB: A Protein Critical for Biofilm Formation and Acid-resistanceJournal of Molecular Biology, 2007
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomesJournal of Molecular Biology, 2001
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresPeptide Science, 1983