Detection of prevalent SARS-CoV-2 variant lineages in wastewater and clinical sequences from cities in Québec, Canada

Abstract
Wastewater-based epidemiology has emerged as a promising tool to monitor pathogens in a population, particularly when clinical diagnostic capacities become overwhelmed. During the ongoing COVID-19 pandemic caused by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), several jurisdictions have tracked viral concentrations in wastewater to inform public health authorities. While some studies have also sequenced SARS-CoV-2 genomes from wastewater, there have been relatively few direct comparisons between viral genetic diversity in wastewater and matched clinical samples from the same region and time period. Here we report sequencing and inference of SARS-CoV-2 mutations and variant lineages (including variants of concern) in 936 wastewater samples and thousands of matched clinical sequences collected between March 2020 and July 2021 in the cities of Montreal, Quebec City, and Laval, representing almost half the population of the Canadian province of Quebec. We benchmarked our sequencing and variant-calling methods on known viral genome sequences to establish thresholds for inferring variants in wastewater with confidence. We found that variant frequency estimates in wastewater and clinical samples are correlated over time in each city, with similar dates of first detection. Across all variant lineages, wastewater detection is more concordant with targeted outbreak sequencing than with semi-random clinical swab sampling. Most variants were first observed in clinical and outbreak data due to higher sequencing rate. However, wastewater sequencing is highly efficient, detecting more variants for a given sampling effort. This shows the potential for wastewater sequencing to provide useful public health data, especially at places or times when sufficient clinical sampling is infrequent or infeasible.