DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication
Top Cited Papers
Open Access
- 2 November 2017
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 34 (6), 1037-1039
- https://doi.org/10.1093/bioinformatics/btx713
Abstract
We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7000 jobs have been processed since its first launch in 2016. Here, we present a newly implemented background annotation engine for DFAST, which is also available as a standalone command-line program. The new engine can annotate a typical-sized bacterial genome within 10 min, with rich information such as pseudogenes, translation exceptions and orthologous gene assignment between given reference genomes. In addition, the modular framework of DFAST allows users to customize the annotation workflow easily and will also facilitate extensions for new functions and incorporation of new tools in the future. The software is implemented in Python 3 and runs in both Python 2.7 and 3.4—on Macintosh and Linux systems. It is freely available at https://github.com/nigyta/dfast_core/under the GPLv3 license with external binaries bundled in the software distribution. An on-line version is also available at https://dfast.nig.ac.jp/. yn@nig.ac.jp Supplementary data are available at Bioinformatics online.Keywords
Funding Information
- JSPS
This publication has 9 references indexed in Scilit:
- CDD/SPARCLE: functional classification of proteins via subfamily domain architecturesNucleic Acids Research, 2016
- DNA Data Bank of JapanNucleic Acids Research, 2016
- NCBI prokaryotic genome annotation pipelineNucleic Acids Research, 2016
- DFAST and DAGA: web-based integrated genome annotation tools and resourcesBioscience of Microbiota, Food and Health, 2016
- The International Nucleotide Sequence Database CollaborationNucleic Acids Research, 2015
- GHOSTX: An Improved Sequence Homology Search Algorithm Using a Query Suffix Array and a Database Suffix ArrayPLOS ONE, 2014
- Prokka: rapid prokaryotic genome annotationBioinformatics, 2014
- TIGRFAMs and Genome Properties in 2013Nucleic Acids Research, 2012
- Adaptive seeds tame genomic sequence comparisonGenome Research, 2011