TagDust2: a generic method to extract reads from sequencing data

Open Access

28 January 2015

journal article
research article
Published by Springer Science and Business Media LLC in BMC Bioinformatics

Vol. 16 (1), 24
https://doi.org/10.1186/s12859-015-0454-y

Abstract

Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial.

Keywords

This publication has 18 references indexed in Scilit:

Trimmomatic: a flexible trimmer for Illumina sequence data
Bioinformatics, 2014
An integrated encyclopedia of DNA elements in the human genome
Nature, 2012
Not All Sequence Tags Are Created Equal: Designing and Validating Sequence Identification Tags Robust to Indels
PLOS ONE, 2012
From Sequencer to Supercomputer: An Automatic Pipeline for Managing and Processing Next Generation Sequencing Data
2012
Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform
Nucleic Acids Research, 2011
Counting absolute numbers of molecules using unique molecular identifiers
Nature Methods, 2011
Cutadapt removes adapter sequences from high-throughput sequencing reads
EMBnet.Journal, 2011
TagDust—a program to eliminate artifacts from next generation sequencing data
Bioinformatics, 2009
Identification of genetic variants using bar-coded multiplexed sequencing
Nature Methods, 2008
Galaxy: A platform for interactive large-scale genome analysis
Genome Research, 2005

Cited by 57 articles