GenBank
Top Cited Papers
Open Access
- 20 November 2015
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 44 (D1), D67-D72
- https://doi.org/10.1093/nar/gkv1276
Abstract
GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for over 340 000 formally described species. Recent developments include a new starting page for submitters, a shift toward using accession.version identifiers rather than GI numbers, a wizard for submitting 16S rRNA sequences, and an Identical Protein Report to address growing issues of data redundancy. GenBank organizes the sequence data received from individual laboratories and large-scale sequencing projects into 18 divisions, and GenBank staff assign unique accession.version identifiers upon data receipt. Most submitters use the web-based BankIt or standalone Sequin programs. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the nuccore, nucest, and nucgss databases of the Entrez retrieval system, which integrates these records with a variety of other data including taxonomy nodes, genomes, protein structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.Keywords
This publication has 14 references indexed in Scilit:
- Content discovery and retrieval services at the European Nucleotide ArchiveNucleic Acids Research, 2014
- RefSeq microbial genomes database: new representation and annotation strategyNucleic Acids Research, 2013
- DDBJ progress report: a new submission system for leading to a correct annotationNucleic Acids Research, 2013
- BLAST: a more efficient report with usability improvementsNucleic Acids Research, 2013
- The International Nucleotide Sequence Database CollaborationNucleic Acids Research, 2012
- The NCBI Taxonomy databaseNucleic Acids Research, 2011
- BioProject and BioSample databases at NCBI: facilitating capture and organization of metadataNucleic Acids Research, 2011
- The sequence read archive: explosive growth of sequencing dataNucleic Acids Research, 2011
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997