Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins
- 17 March 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 106 (11), 4201-4206
- https://doi.org/10.1073/pnas.0811922106
Abstract
Protein folding often competes with intermolecular aggregation, which in most cases irreversibly impairs protein function, as exemplified by the formation of inclusion bodies. Although it has been empirically determined that some proteins tend to aggregate, the relationship between the protein aggregation propensities and the primary sequences remains poorly understood. Here, we individually synthesized the entire ensemble of Escherichia coli proteins by using an in vitro reconstituted translation system and analyzed the aggregation propensities. Because the reconstituted translation system is chaperone-free, we could evaluate the inherent aggregation propensities of thousands of proteins in a translation-coupled manner. A histogram of the solubilities, based on data from 3,173 translated proteins, revealed a clear bimodal distribution, indicating that the aggregation propensities are not evenly distributed across a continuum. Instead, the proteins can be categorized into 2 groups, soluble and aggregation-prone proteins. The aggregation propensity is most prominently correlated with the structural classification of proteins, implying that the prediction of aggregation propensity requires structural information about the protein.Keywords
This publication has 42 references indexed in Scilit:
- Comprehensive Analysis of the Effects of Escherichia coli ORFs on Protein Translation ReactionMolecular & Cellular Proteomics, 2008
- Construction of consecutive deletions of the Escherichia coli chromosomeMolecular Systems Biology, 2007
- Mutagenesis of the central hydrophobic cluster in Aβ42 Alzheimer's peptideThe FEBS Journal, 2006
- Prediction of aggregation rate and aggregation‐prone segments in polypeptide sequencesProtein Science, 2005
- Protein structure prediction servers at University College LondonNucleic Acids Research, 2005
- Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteinsNature Biotechnology, 2004
- Protein folding and misfoldingNature, 2003
- Rationalization of the effects of mutations on peptide andprotein aggregation ratesNature, 2003
- Hsp90 as a capacitor of phenotypic variationNature, 2002
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences of the United States of America, 1981