Software for pre-processing Illumina next-generation sequencing short read sequences
Open Access
- 3 May 2014
- journal article
- Published by Springer Science and Business Media LLC in Source Code for Biology and Medicine
- Vol. 9 (1), 8
- https://doi.org/10.1186/1751-0473-9-8
Abstract
When compared to Sanger sequencing technology, next-generation sequencing (NGS) technologies are hindered by shorter sequence read length, higher base-call error rate, non-uniform coverage, and platform-specific sequencing artifacts. These characteristics lower the quality of their downstream analyses, e.g. de novo and reference-based assembly, by introducing sequencing artifacts and errors that may contribute to incorrect interpretation of data. Although many tools have been developed for quality control and pre-processing of NGS data, none of them provide flexible and comprehensive trimming options in conjunction with parallel processing to expedite pre-processing of large NGS datasets.Keywords
This publication has 30 references indexed in Scilit:
- Trimmomatic: a flexible trimmer for Illumina sequence dataBioinformatics, 2014
- GAGE: A critical evaluation of genome assemblies and assembly algorithmsGenome Research, 2011
- Assemblathon 1: A competitive assessment of de novo short read assembly methodsGenome Research, 2011
- Sense from sequence reads: methods for alignment and assemblyNature Methods, 2009
- Evaluation of next generation sequencing platforms for population targeted sequencing studiesGenome Biology, 2009
- Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlxBMC Genomics, 2009
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Genome assembly forensics: finding the elusive mis-assemblyGenome Biology, 2008
- High-Resolution Profiling of Histone Methylations in the Human GenomeCell, 2007