MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
Top Cited Papers
Open Access
- 20 January 2015
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 31 (10), 1674-1676
- https://doi.org/10.1093/bioinformatics/btv033
Abstract
Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252 Gbps in 44.1 and 99.6 h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement. Availability and implementation: The source code of MEGAHIT is freely available at https://github.com/voutcn/megahit under GPLv3 license. Contact:rb@l3-bioinfo.com or twlam@cs.hku.hk Supplementary information: Supplementary data are available at Bioinformatics online.Other Versions
This publication has 9 references indexed in Scilit:
- Tackling soil diversity with the assembly of large, complex metagenomesProceedings of the National Academy of Sciences of the United States of America, 2014
- QUAST: quality assessment tool for genome assembliesBioinformatics, 2013
- SOAPdenovo2: an empirically improved memory-efficient short-read de novo assemblerGigaScience, 2012
- SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell SequencingJournal of Computational Biology, 2012
- IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depthBioinformatics, 2012
- Fast gapped-read alignment with Bowtie 2Nature Methods, 2012
- Space-Efficient and Exact de Bruijn Graph Representation Based on a Bloom FilterLecture Notes in Computer Science, 2012
- Succinct de Bruijn GraphsLecture Notes in Computer Science, 2012
- A human gut microbial gene catalogue established by metagenomic sequencingNature, 2010