Evaluation of NGS-based approaches for SARS-CoV-2 whole genome characterisation
Open Access
- 1 July 2020
- journal article
- research article
- Published by Oxford University Press (OUP) in Virus Evolution
- Vol. 6 (2), veaa075
- https://doi.org/10.1093/ve/veaa075
Abstract
Since the beginning of the COVID-19 outbreak, SARS-CoV-2 whole-genome sequencing (WGS) has been performed at unprecedented rate worldwide with the use of very diverse Next-Generation Sequencing (NGS) methods. Herein, we compare the performance of four NGS-based approaches for SARS-CoV-2 WGS. Twenty-four clinical respiratory samples with a large scale of Ct values (from 10.7 to 33.9) were sequenced with four methods. Three used Illumina sequencing: an in-house metagenomic NGS (mNGS) protocol and two newly commercialised kits including a hybridisation capture method developed by Illumina (DNA Prep with Enrichment kit and Respiratory Virus Oligo Panel, RVOP), and an amplicon sequencing method developed by Paragon Genomics (CleanPlex SARS-CoV-2 kit). We also evaluated the widely used amplicon sequencing protocol developed by ARTIC Network and combined with Oxford Nanopore Technologies (ONT) sequencing. All four methods yielded near-complete genomes (>99%) for high viral loads samples (n = 8), with mNGS and RVOP producing the most complete genomes. For mid viral loads (Ct 20–25), amplicon-based enrichment methods led to genome coverage >99 per cent for all samples while 1/8 sample sequenced with RVOP and 2/8 samples sequenced with mNGS had a genome coverage below 99 per cent. For low viral loads (Ct ≥25), amplicon-based enrichment methods were the most sensitive techniques. All methods were highly concordant in terms of identity in complete consensus sequence. Just one mismatch in three samples was observed in CleanPlex vs the other methods, due to the dedicated bioinformatics pipeline setting a high threshold to call SNP compared to reference sequence. Importantly, all methods correctly identified a newly observed 34nt-deletion in ORF6 but required specific bioinformatic validation for RVOP. Finally, as a major warning for targeted techniques, a loss of coverage in any given region of the genome should alert to a potential rearrangement or a SNP in primer-annealing or probe-hybridizing regions and would require further validation using unbiased metagenomic sequencing.Keywords
Funding Information
- National Research Agency (ANR-20-COVI-0064)
This publication has 23 references indexed in Scilit:
- Tracking virus outbreaks in the twenty-first centuryNature Microbiology, 2018
- Quality control implementation for universal characterization of DNA and RNA viruses in clinical respiratory samples using single metagenomic next-generation sequencing workflowBMC Infectious Diseases, 2018
- Detection of Viral Pathogens With Multiplex Nanopore MinION Sequencing: Be Careful With Cross-TalkFrontiers in Microbiology, 2018
- Minimap2: pairwise alignment for nucleotide sequencesBioinformatics, 2018
- Comprehensive viral enrichment enables sensitive respiratory virus genomic identification and analysis by next generation sequencingGenome Research, 2018
- A complete bacterial genome assembled de novo using only nanopore sequencing dataNature Methods, 2015
- Metagenomics and future perspectives in virus discoveryCurrent Opinion in Virology, 2012
- Integrative genomics viewerNature Biotechnology, 2011
- BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics, 2010
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009