Identification of direct residue contacts in protein–protein interaction by message passing
- 6 January 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 106 (1), 67-72
- https://doi.org/10.1073/pnas.0805923106
Abstract
Understanding the molecular determinants of specificity in protein protein interaction is an outstanding challenge of postgenome biology. The availability of large protein databases generated from sequences of hundreds of bacterial genomes enables various statistical approaches to this problem. In this context covariance-based methods have been used to identify correlation between amino acid positions in interacting proteins. However, these methods have an important shortcoming, in that they cannot distinguish between directly and indirectly correlated residues. We developed a method that combines covariance analysis with global inference analysis, adopted from use in statistical physics. Applied to a set of > 2,500 representatives of the bacterial two-component signal transduction system, the combination of covariance with global inference successfully and robustly identified residue pairs that are proximal in space without resorting to ad hoc tuning parameters, both for heterointeractions between sensor kinase (SK) and response regulator (RR) proteins and for homointeractions between RR proteins. The spectacular success of this approach illustrates the effectiveness of the global inference approach in identifying direct interaction based on sequence information alone. We expect this method to be applicable soon to interaction surfaces between proteins present in only 1 copy per genome as the number of sequenced genomes continues to expand. Use of this method could significantly increase the potential targets for therapeutic intervention, shed light on the mechanism of protein-protein interaction, and establish the foundation for the accurate prediction of interacting protein partners.Keywords
This publication has 38 references indexed in Scilit:
- Co-Evolving Motions at Protein−Protein Interfaces of Two-Component Signaling Systems Identified by Covariance AnalysisBiochemistry, 2008
- Accurate prediction of protein–protein interactions from sequence alignments using a Bayesian methodMolecular Systems Biology, 2008
- Reaching for high-hanging fruit in drug discovery at protein–protein interfacesNature, 2007
- The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadataNucleic Acids Research, 2007
- Crystal Structures of the Receiver Domain of the Response Regulator PhoP from Escherichia coli in the Absence and Presence of the Phosphoryl Analog BeryllofluorideJournal of Bacteriology, 2007
- MiST: a microbial signal transduction databaseNucleic Acids Research, 2006
- Influence of conservation on calculations of amino acid covariance in multiple sequence alignmentsProteins: Structure, Function, and Bioinformatics, 2004
- Evolutionarily conserved networks of residues mediate allosteric communication in proteinsNature Structural & Molecular Biology, 2002
- Mapping pathways of allosteric communication in GroEL by analysis of correlated mutationsProteins: Structure, Function, and Bioinformatics, 2002