Structure-based prediction of protein–protein interactions on a genome-wide scale

Open Access

30 September 2012

journal article
research article
Published by Springer Science and Business Media LLC in Nature

Vol. 490 (7421), 556-560
https://doi.org/10.1038/nature11503

Abstract

Protein–protein interactions, essential for understanding how a cell functions, are predicted using a new method that combines protein structure with other computationally and experimentally derived clues. The analysis of protein-interaction networks is essential to an understanding of the regulatory processes in a living cell. Many methods have been developed with a view to predicting protein–protein interactions (PPIs) at a genome-wide level, although the differences obtained using these approaches suggest that there are still factors unaccounted for. Barry Honig and colleagues have developed a new way of predicting PPIs that is based on the proteins' three-dimensional structures and functional data. Tests of several predictions of the new algorithm, known as PREPPI, confirm the accuracy of the results. The genome-wide identification of pairs of interacting proteins is an important step in the elucidation of cell regulatory mechanisms1,2. Much of our present knowledge derives from high-throughput techniques such as the yeast two-hybrid assay and affinity purification3, as well as from manual curation of experiments on individual systems4. A variety of computational approaches based, for example, on sequence homology, gene co-expression and phylogenetic profiles, have also been developed for the genome-wide inference of protein–protein interactions (PPIs)5,6. Yet comparative studies suggest that the development of accurate and complete repertoires of PPIs is still in its early stages7,8,9. Here we show that three-dimensional structural information can be used to predict PPIs with an accuracy and coverage that are superior to predictions based on non-structural evidence. Moreover, an algorithm, termed PrePPI, which combines structural information with other functional clues, is comparable in accuracy to high-throughput experiments, yielding over 30,000 high-confidence interactions for yeast and over 300,000 for human. Experimental tests of a number of predictions demonstrate the ability of the PrePPI algorithm to identify unexpected PPIs of considerable biological interest. The surprising effectiveness of three-dimensional structural information can be attributed to the use of homology models combined with the exploitation of both close and remote geometric relationships between proteins.

Keywords

This publication has 44 references indexed in Scilit:

Interactome Networks and Human Disease
Cell, 2011
Structural space of protein–protein interfaces is degenerate, close to complete, and highly connected
Proceedings of the National Academy of Sciences of the United States of America, 2010
A human B‐cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers
Molecular Systems Biology, 2010
Nature of the protein universe
Proceedings of the National Academy of Sciences of the United States of America, 2009
An experimentally derived confidence score for binary protein-protein interactions
Nature Methods, 2008
Architectures and Functional Coverage of Protein–Protein Interfaces
Journal of Molecular Biology, 2008
Prism: Protein-Protein Interaction Prediction by Structural Matching
Published by Springer Science and Business Media LLC ,2008
Inference of Macromolecular Assemblies from Crystalline State
Journal of Molecular Biology, 2007
Large‐scale mapping of human protein–protein interactions by mass spectrometry
Molecular Systems Biology, 2007
Comparative assessment of large-scale data sets of protein–protein interactions
Nature, 2002

Cited by 668 articles