High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing
Top Cited Papers
- 1 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Methods
- Vol. 18 (2), 165-+
- https://doi.org/10.1038/s41592-020-01041-y
Abstract
High-throughput amplicon sequencing of large genomic regions remains challenging for short-read technologies. Here, we report a high-throughput amplicon sequencing approach combining unique molecular identifiers (UMIs) with Oxford Nanopore Technologies (ONT) or Pacific Biosciences circular consensus sequencing, yielding high-accuracy single-molecule consensus sequences of large genomic regions. We applied our approach to sequence ribosomal RNA operon amplicons (similar to 4,500 bp) and genomic sequences (>10,000 bp) of reference microbial communities in which we observed a chimera rate <0.02%. To reach a mean UMI consensus error rate <0.01%, a UMI read coverage of 15x (ONT R10.3), 25x (ONT R9.4.1) and 3x (Pacific Biosciences circular consensus sequencing) is needed, which provides a mean error rate of 0.0042%, 0.0041% and 0.0007%, respectively.Funding Information
- Villum Fonden (15510)
- Poul Due Jensen Foundation / Grundfos foundation: Grant reference “Microflora Danica”.
- Genome British Columbia (SIP011)
- Natural Sciences and Engineering Research Council of Canada
This publication has 61 references indexed in Scilit:
- Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studiesNucleic Acids Research, 2012
- DECIPHER, a Search-Based Approach to Chimera Identification for 16S rRNA SequencesApplied and Environmental Microbiology, 2012
- An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaeaThe ISME Journal, 2011
- A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing dataBioinformatics, 2011
- Counting individual DNA molecules by the stochastic attachment of diverse labelsProceedings of the National Academy of Sciences of the United States of America, 2011
- Search and clustering orders of magnitude faster than BLASTBioinformatics, 2010
- Global patterns of 16S rRNA diversity at a depth of millions of sequences per sampleProceedings of the National Academy of Sciences of the United States of America, 2010
- Parallel, tag-directed assembly of locally derived short sequence readsNature Methods, 2010
- Subclonal phylogenetic structures in cancer revealed by ultra-deep sequencingProceedings of the National Academy of Sciences of the United States of America, 2008
- RNAmmer: consistent and rapid annotation of ribosomal RNA genesNucleic Acids Research, 2007