Predicting the molecular complexity of sequencing libraries
Open Access
- 24 February 2013
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Methods
- Vol. 10 (4), 325-327
- https://doi.org/10.1038/nmeth.2375
Abstract
A statistical method and software yields accurate predictions of sequencing library complexity on the basis of initial shallow sequencing surveys, allowing robust estimates of how deep to sequence for adequate coverage. Predicting the molecular complexity of a genomic sequencing library is a critical but difficult problem in modern sequencing applications. Methods to determine how deeply to sequence to achieve complete coverage or to predict the benefits of additional sequencing are lacking. We introduce an empirical Bayesian method to accurately characterize the molecular complexity of a DNA sample for almost any sequencing application on the basis of limited preliminary sequencing.Keywords
This publication has 14 references indexed in Scilit:
- Systematic evaluation of factors influencing ChIP-seq fidelityNature Methods, 2012
- Counting absolute numbers of molecules using unique molecular identifiersNature Methods, 2011
- The DNA-Binding Protein CTCF Limits Proximal Vκ Recombination and Restricts κ Enhancer Interactions to the Immunoglobulin κ Light Chain LocusImmunity, 2011
- Sperm Methylation Profiles Reveal Features of Epigenetic Inheritance and Evolution in PrimatesCell, 2011
- Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cellsNature, 2011
- Estimating the number of classesThe Annals of Statistics, 2007
- The Classical Moment Problem as a Self-Adjoint Finite Difference OperatorAdvances in Mathematics, 1998
- Genomic mapping by fingerprinting random clones: A mathematical analysisGenomics, 1988
- THE NUMBER OF NEW SPECIES, AND THE INCREASE IN POPULATION COVERAGE, WHEN A SAMPLE IS INCREASEDBiometrika, 1956
- The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal PopulationJournal of Animal Ecology, 1943