The PDBbind Database: Methodologies and Updates

Abstract
We have developed the PDBbind database to provide a comprehensive collection of binding affinities for the protein−ligand complexes in the Protein Data Bank (PDB). This paper gives a full description of the latest version, i.e., version 2003, which is an update to our recently reported work. Out of 23 790 entries in the PDB release No.107 (January 2004), 5897 entries were identified as protein−ligand complexes that meet our definition. Experimentally determined binding affinities (Kd, Ki, and IC50) for 1622 of these were retrieved from the references associated with these complexes. A total of 900 complexes were selected to form a “refined set”, which is of particular value as a standard data set for docking and scoring studies. All of the final data, including binding affinity data, reference citations, and processed structural files, have been incorporated into the PDBbind database accessible on-line at http:// www.pdbbind.org/.