The UniProt-GO Annotation database in 2011
Open Access
- 26 November 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 40 (D1), D565-D570
- https://doi.org/10.1093/nar/gkr1048
Abstract
The GO annotation dataset provided by the UniProt Consortium (GOA: http://www.ebi.ac.uk/GOA) is a comprehensive set of evidenced-based associations between terms from the Gene Ontology resource and UniProtKB proteins. Currently supplying over 100 million annotations to 11 million proteins in more than 360 000 taxa, this resource has increased 2-fold over the last 2 years and has benefited from a wealth of checks to improve annotation correctness and consistency as well as now supplying a greater information content enabled by GO Consortium annotation format developments. Detailed, manual GO annotations obtained from the curation of peer-reviewed papers are directly contributed by all UniProt curators and supplemented with manual and electronic annotations from 36 model organism and domain-focused scientific resources. The inclusion of high-quality, automatic annotation predictions ensures the UniProt GO annotation dataset supplies functional information to a wide range of proteins, including those from poorly characterized, non-model organism species. UniProt GO annotations are freely available in a range of formats accessible by both file downloads and web-based views. In addition, the introduction of a new, normalized file format in 2010 has made for easier handling of the complete UniProt-GOA data set.This publication has 25 references indexed in Scilit:
- Reorganizing the protein space at the Universal Protein Resource (UniProt)Nucleic Acids Research, 2011
- The Gene Ontology: enhancements for 2011Nucleic Acids Research, 2011
- EcoCyc: a comprehensive database of Escherichia coli biologyNucleic Acids Research, 2010
- dictyBase update 2011: web 2.0 functionality and the initial steps towards a genome portal for the AmoebozoaNucleic Acids Research, 2010
- The Plant-Associated Microbe Gene Ontology (PAMGO) Consortium: community development of new Gene Ontology terms describing biological processes involved in microbe-host interactionsBMC Microbiology, 2009
- The GOA database in 2009--an integrated Gene Ontology Annotation resourceNucleic Acids Research, 2009
- InterPro: the integrative protein signature databaseNucleic Acids Research, 2008
- EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebratesGenome Research, 2008
- The Integr8 project--a resource for genomic and proteomic data.2005
- The International Protein Index: An integrated database for proteomics experimentsProteomics, 2004