ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data
Open Access
- 1 March 2018
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Genomics
- Vol. 19 (1), 169
- https://doi.org/10.1186/s12864-018-4559-3
Abstract
Background: ATAC-seq (Assays for Transposase-Accessible Chromatin using sequencing) is a recently developed technique for genome-wide analysis of chromatin accessibility. Compared to earlier methods for assaying chromatin accessibility, ATAC-seq is faster and easier to perform, does not require cross-linking, has higher signal to noise ratio, and can be performed on small cell numbers. However, to ensure a successful ATAC-seq experiment, step-by-step quality assurance processes, including both wet lab quality control and in silico quality assessment, are essential. While several tools have been developed or adopted for assessing read quality, identifying nucleosome occupancy and accessible regions from ATAC-seq data, none of the tools provide a comprehensive set of functionalities for preprocessing and quality assessment of aligned ATAC-seq datasets. Results: We have developed a Bioconductor package, ATACseqQC, for easily generating various diagnostic plots to help researchers quickly assess the quality of their ATAC-seq data. In addition, this package contains functions to preprocess aligned ATAC-seq data for subsequent peak calling. Here we demonstrate the utilities of our package using 25 publicly available ATAC-seq datasets from four studies. We also provide guidelines on what the diagnostic plots should look like for an ideal ATAC-seq dataset. Conclusions: This software package has been used successfully for preprocessing and assessing several in-house and public ATAC-seq datasets. Diagnostic plots generated by this package will facilitate the quality assessment of ATAC-seq data, and help researchers to evaluate their own ATAC-seq experiments as well as select high-quality ATAC-seq datasets from public repositories such as GEO to avoid generating hypotheses or drawing conclusions from low-quality ATAC-seq experiments.Keywords
This publication has 37 references indexed in Scilit:
- SRAdb: query and use public next-generation sequencing data from within RBMC Bioinformatics, 2013
- ChIP-seq guidelines and practices of the ENCODE and modENCODE consortiaGenome Research, 2012
- Fast gapped-read alignment with Bowtie 2Nature Methods, 2012
- Nucleosome distribution and linker DNA: connecting nuclear function to dynamic chromatin structureThis paper is one of a selection of papers published in a Special Issue entitled 31st Annual International Asilomar Chromatin and Chromosomes Conference, and has undergone the Journal’s usual peer review process.Biochemistry and Cell Biology, 2011
- Integrative genomics viewerNature Biotechnology, 2011
- ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip dataBMC Bioinformatics, 2010
- DNase-seq: A High-Resolution Technique for Mapping Active Gene Regulatory Elements across the Genome from Mammalian CellsCold Spring Harbor Protocols, 2010
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- Dynamic Regulation of Nucleosome Positioning in the Human GenomeCell, 2008