TagDust2: a generic method to extract reads from sequencing data
Open Access
- 28 January 2015
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 16 (1), 24
- https://doi.org/10.1186/s12859-015-0454-y
Abstract
Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial.Keywords
This publication has 18 references indexed in Scilit:
- Trimmomatic: a flexible trimmer for Illumina sequence dataBioinformatics, 2014
- An integrated encyclopedia of DNA elements in the human genomeNature, 2012
- Not All Sequence Tags Are Created Equal: Designing and Validating Sequence Identification Tags Robust to IndelsPLOS ONE, 2012
- From Sequencer to Supercomputer: An Automatic Pipeline for Managing and Processing Next Generation Sequencing Data2012
- Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platformNucleic Acids Research, 2011
- Counting absolute numbers of molecules using unique molecular identifiersNature Methods, 2011
- Cutadapt removes adapter sequences from high-throughput sequencing readsEMBnet.Journal, 2011
- TagDust—a program to eliminate artifacts from next generation sequencing dataBioinformatics, 2009
- Identification of genetic variants using bar-coded multiplexed sequencingNature Methods, 2008
- Galaxy: A platform for interactive large-scale genome analysisGenome Research, 2005