High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing

Top Cited Papers

1 February 2021

journal article
research article
Published by Springer Science and Business Media LLC in Nature Methods

Vol. 18 (2), 165-+
https://doi.org/10.1038/s41592-020-01041-y

Abstract

High-throughput amplicon sequencing of large genomic regions remains challenging for short-read technologies. Here, we report a high-throughput amplicon sequencing approach combining unique molecular identifiers (UMIs) with Oxford Nanopore Technologies (ONT) or Pacific Biosciences circular consensus sequencing, yielding high-accuracy single-molecule consensus sequences of large genomic regions. We applied our approach to sequence ribosomal RNA operon amplicons (similar to 4,500 bp) and genomic sequences (>10,000 bp) of reference microbial communities in which we observed a chimera rate <0.02%. To reach a mean UMI consensus error rate <0.01%, a UMI read coverage of 15x (ONT R10.3), 25x (ONT R9.4.1) and 3x (Pacific Biosciences circular consensus sequencing) is needed, which provides a mean error rate of 0.0042%, 0.0041% and 0.0007%, respectively.

Funding Information

Villum Fonden (15510)
Poul Due Jensen Foundation / Grundfos foundation: Grant reference “Microflora Danica”.
Genome British Columbia (SIP011)
Natural Sciences and Engineering Research Council of Canada

This publication has 61 references indexed in Scilit:

Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies
Nucleic Acids Research, 2012
DECIPHER, a Search-Based Approach to Chimera Identification for 16S rRNA Sequences
Applied and Environmental Microbiology, 2012
An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
The ISME Journal, 2011
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data
Bioinformatics, 2011
Counting individual DNA molecules by the stochastic attachment of diverse labels
Proceedings of the National Academy of Sciences of the United States of America, 2011
Search and clustering orders of magnitude faster than BLAST
Bioinformatics, 2010
Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample
Proceedings of the National Academy of Sciences of the United States of America, 2010
Parallel, tag-directed assembly of locally derived short sequence reads
Nature Methods, 2010
Subclonal phylogenetic structures in cancer revealed by ultra-deep sequencing
Proceedings of the National Academy of Sciences of the United States of America, 2008
RNAmmer: consistent and rapid annotation of ribosomal RNA genes
Nucleic Acids Research, 2007

Cited by 204 articles