A discriminative approach for identifying domain–domain interactions from protein–protein interactions

26 October 2009

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 78 (5), 1243-1253
https://doi.org/10.1002/prot.22643

Abstract

Protein domains are functional and structural units of proteins. Therefore, identification of domain–domain interactions (DDIs) can provide insight into the biological functions of proteins. In this article, we propose a novel discriminative approach for predicting DDIs based on both protein–protein interactions (PPIs) and the derived information of non-PPIs. We make a threefold contribution to the work in this area. First, we take into account non-PPIs explicitly and treat the domain combinations that can discriminate PPIs from non-PPIs as putative DDIs. Second, DDI identification is formalized as a feature selection problem, in which it tries to find out a minimum set of informative features (i.e., putative DDIs) that discriminate PPIs from non-PPIs, which is plausible in biology and is able to predict DDIs in a systematic and accurate manner. Third, multidomain combinations including two-domain combinations are taken into account in the proposed method, where multidomain cooperations may help proteins to interact with each other. Numerical results on several DDI prediction benchmark data sets show that the proposed discriminative method performs comparably well with other top algorithms with respect to overall performance, and outperforms other methods in terms of precision. The PPI data sets used for prediction of DDIs and prediction results can be found at http://csb.shu.edu.cn/dipd. Proteins 2010.

Keywords

This publication has 39 references indexed in Scilit:

Interrogating domain-domain interactions with parsimony based approaches
BMC Bioinformatics, 2008
Gene function prediction using labeled and unlabeled data
BMC Bioinformatics, 2008
Analysis on multi-domain cooperation for predicting protein-protein interactions
BMC Bioinformatics, 2007
PROTCOM: searchable database of protein complexes enhanced with domain-domain structures
Nucleic Acids Research, 2006
Co-evolutionary Analysis of Domains in Interacting Proteins Reveals Insights into Domain–Domain Interactions Mediating Protein–Protein Interactions
Journal of Molecular Biology, 2006
Choosing negative examples for the prediction of protein-protein interactions
BMC Bioinformatics, 2006
The Pfam protein families database
Nucleic Acids Research, 2004
Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry
Nature, 2002
Functional organization of the yeast proteome by systematic analysis of protein complexes
Nature, 2002
The Protein Data Bank
Nucleic Acids Research, 2000

Cited by 39 articles