PIBASE: a comprehensive database of structurally defined protein interfaces

Open Access

18 January 2005

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 21 (9), 1901-1907
https://doi.org/10.1093/bioinformatics/bti277

Abstract

Motivation: In recent years, the Protein Data Bank (PDB) has experienced rapid growth. To maximize the utility of the high resolution protein–protein interaction data stored in the PDB, we have developed PIBASE, a comprehensive relational database of structurally defined interfaces between pairs of protein domains. It is composed of binary interfaces extracted from structures in the PDB and the Probable Quaternary Structure server using domain assignments from the Structural Classification of Proteins and CATH fold classification systems. Results: PIBASE currently contains 158 915 interacting domain pairs between 105 061 domains from 2125 SCOP families. A diverse set of geometric, physiochemical and topologic properties are calculated for each complex, its domains, interfaces and binding sites. A subset of the interface properties are used to remove interface redundancy within PDB entries, resulting in 20 912 distinct domain–domain interfaces. The complexes are grouped into 989 topological classes based on their patterns of domain–domain contacts. The binary interfaces and their corresponding binding sites are categorized into 18 755 and 30 975 topological classes, respectively, based on the topology of secondary structure elements. The utility of the database is illustrated by outlining several current applications. Availability: The database is accessible via the world wide web at http://salilab.org/pibase Contact:sali@salilab.org Supplementary information:http://salilab.org/pibase/suppinfo.html

Keywords

This publication has 45 references indexed in Scilit:

BIND: the Biomolecular Interaction Network Database
Nucleic Acids Research, 2003
Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry
Nature, 2002
Functional organization of the yeast proteome by systematic analysis of protein complexes
Nature, 2002
A comprehensive two-hybrid analysis to explore the yeast protein interactome
Proceedings of the National Academy of Sciences of the United States of America, 2001
The Protein Data Bank
Nucleic Acids Research, 2000
CATH – a hierarchic classification of protein domain structures
Structure, 1997
Comparative Protein Modelling by Satisfaction of Spatial Restraints
Journal of Molecular Biology, 1993
A novel genetic system to detect protein–protein interactions
Nature, 1989
Surface, subunit interfaces and interior of oligomeric proteins
Journal of Molecular Biology, 1988
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Peptide Science, 1983

Cited by 158 articles