Detection of prevalent SARS-CoV-2 variant lineages in wastewater and clinical sequences from cities in Québec, Canada
Preprint
- 1 February 2022
- preprint
- research article
- Published by Cold Spring Harbor Laboratory
Abstract
Wastewater-based epidemiology has emerged as a promising tool to monitor pathogens in a population, particularly when clinical diagnostic capacities become overwhelmed. During the ongoing COVID-19 pandemic caused by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), several jurisdictions have tracked viral concentrations in wastewater to inform public health authorities. While some studies have also sequenced SARS-CoV-2 genomes from wastewater, there have been relatively few direct comparisons between viral genetic diversity in wastewater and matched clinical samples from the same region and time period. Here we report sequencing and inference of SARS-CoV-2 mutations and variant lineages (including variants of concern) in 936 wastewater samples and thousands of matched clinical sequences collected between March 2020 and July 2021 in the cities of Montreal, Quebec City, and Laval, representing almost half the population of the Canadian province of Quebec. We benchmarked our sequencing and variant-calling methods on known viral genome sequences to establish thresholds for inferring variants in wastewater with confidence. We found that variant frequency estimates in wastewater and clinical samples are correlated over time in each city, with similar dates of first detection. Across all variant lineages, wastewater detection is more concordant with targeted outbreak sequencing than with semi-random clinical swab sampling. Most variants were first observed in clinical and outbreak data due to higher sequencing rate. However, wastewater sequencing is highly efficient, detecting more variants for a given sampling effort. This shows the potential for wastewater sequencing to provide useful public health data, especially at places or times when sufficient clinical sampling is infrequent or infeasible.Keywords
This publication has 20 references indexed in Scilit:
- SARS-CoV-2 within-host diversity and transmissionScience, 2021
- Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in EnglandScience, 2021
- Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in EnglandNature, 2021
- Genome Sequencing of Sewage Detects Regionally Prevalent SARS-CoV-2 VariantsmBio, 2021
- Shedding of SARS-CoV-2 in feces and urine and its potential role in person-to-person transmission and the environment-based spread of COVID-19Science of The Total Environment, 2020
- A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiologyNature Microbiology, 2020
- Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020Eurosurveillance, 2020
- fastp: an ultra-fast all-in-one FASTQ preprocessorBioinformatics, 2018
- VarScan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencingGenome Research, 2012
- A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing dataBioinformatics, 2011