Protein Information Resource: a community resource for expert annotation of protein data
Open Access
- 1 January 2001
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 29 (1), 29-32
- https://doi.org/10.1093/nar/29.1.29
Abstract
The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-International databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP.Keywords
This publication has 13 references indexed in Scilit:
- iProClass: an integrated, comprehensive and annotated protein classification databaseNucleic Acids Research, 2001
- The RESID Database of protein structure modifications and the NRL-3D Sequence-Structure DatabaseNucleic Acids Research, 2001
- PIR: a new resource for bioinformaticsBioinformatics, 2000
- ProClass protein family databaseNucleic Acids Research, 2000
- The COG database: a tool for genome-scale analysis of protein functions and evolutionNucleic Acids Research, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- PIR-ALN: a database of protein sequence alignments.Bioinformatics, 1999
- Superfamily classification in PIR-international protein sequence databaseMethods in enzymology, 1996
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994