Predicting the molecular complexity of sequencing libraries

Open Access

24 February 2013

journal article
research article
Published by Springer Science and Business Media LLC in Nature Methods

Vol. 10 (4), 325-327
https://doi.org/10.1038/nmeth.2375

Abstract

A statistical method and software yields accurate predictions of sequencing library complexity on the basis of initial shallow sequencing surveys, allowing robust estimates of how deep to sequence for adequate coverage. Predicting the molecular complexity of a genomic sequencing library is a critical but difficult problem in modern sequencing applications. Methods to determine how deeply to sequence to achieve complete coverage or to predict the benefits of additional sequencing are lacking. We introduce an empirical Bayesian method to accurately characterize the molecular complexity of a DNA sample for almost any sequencing application on the basis of limited preliminary sequencing.

Keywords

This publication has 14 references indexed in Scilit:

Systematic evaluation of factors influencing ChIP-seq fidelity
Nature Methods, 2012
Counting absolute numbers of molecules using unique molecular identifiers
Nature Methods, 2011
The DNA-Binding Protein CTCF Limits Proximal Vκ Recombination and Restricts κ Enhancer Interactions to the Immunoglobulin κ Light Chain Locus
Immunity, 2011
Sperm Methylation Profiles Reveal Features of Epigenetic Inheritance and Evolution in Primates
Cell, 2011
Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells
Nature, 2011
Estimating the number of classes
The Annals of Statistics, 2007
The Classical Moment Problem as a Self-Adjoint Finite Difference Operator
Advances in Mathematics, 1998
Genomic mapping by fingerprinting random clones: A mathematical analysis
Genomics, 1988
THE NUMBER OF NEW SPECIES, AND THE INCREASE IN POPULATION COVERAGE, WHEN A SAMPLE IS INCREASED
Biometrika, 1956
The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population
Journal of Animal Ecology, 1943

Cited by 307 articles