Protein Information Resource: a community resource for expert annotation of protein data

Open Access

1 January 2001

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 29 (1), 29-32
https://doi.org/10.1093/nar/29.1.29

Abstract

The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-International databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP.

Keywords

This publication has 13 references indexed in Scilit:

iProClass: an integrated, comprehensive and annotated protein classification database
Nucleic Acids Research, 2001
The RESID Database of protein structure modifications and the NRL-3D Sequence-Structure Database
Nucleic Acids Research, 2001
PIR: a new resource for bioinformatics
Bioinformatics, 2000
ProClass protein family database
Nucleic Acids Research, 2000
The COG database: a tool for genome-scale analysis of protein functions and evolution
Nucleic Acids Research, 2000
The Pfam Protein Families Database
Nucleic Acids Research, 2000
PIR-ALN: a database of protein sequence alignments.
Bioinformatics, 1999
Superfamily classification in PIR-international protein sequence database
Methods in enzymology, 1996
Maximum Discrimination Hidden Markov Models of Sequence Consensus
Journal of Computational Biology, 1995
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
Nucleic Acids Research, 1994

Cited by 60 articles