Sixteen novel lineages of SARS-CoV-2 in South Africa
Top Cited Papers
- 2 February 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Medicine
- Vol. 27 (3), 440-446
- https://doi.org/10.1038/s41591-021-01255-3
Abstract
The first severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in South Africa was identified on 5 March 2020, and by 26 March the country was in full lockdown (Oxford stringency index of 90)1. Despite the early response, by November 2020, over 785,000 people in South Africa were infected, which accounted for approximately 50% of all known African infections2. In this study, we analyzed 1,365 near whole genomes and report the identification of 16 new lineages of SARS-CoV-2 isolated between 6 March and 26 August 2020. Most of these lineages have unique mutations that have not been identified elsewhere. We also show that three lineages (B.1.1.54, B.1.1.56 and C.1) spread widely in South Africa during the first wave, comprising ~42% of all infections in the country at the time. The newly identified C lineage of SARS-CoV-2, C.1, which has 16 nucleotide mutations as compared with the original Wuhan sequence, including one amino acid change on the spike protein, D614G (ref. 3), was the most geographically widespread lineage in South Africa by the end of August 2020. An early South African-specific lineage, B.1.106, which was identified in April 2020 (ref. 4), became extinct after nosocomial outbreaks were controlled in KwaZulu-Natal Province. Our findings show that genomic surveillance can be implemented on a large scale in Africa to identify new lineages and inform measures to control the spread of SARS-CoV-2. Such genomic surveillance presented in this study has been shown to be crucial in the identification of the 501Y.V2 variant in South Africa in December 2020 (ref. 5).Keywords
This publication has 38 references indexed in Scilit:
- SERAPHIM: studying environmental rasters and phylogenetically informed movementsBioinformatics, 2016
- Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen)Virus Evolution, 2016
- IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood PhylogeniesMolecular Biology and Evolution, 2014
- A New Framework and Software to Estimate Time-Varying Reproduction Numbers During EpidemicsAmerican Journal of Epidemiology, 2013
- MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and UsabilityMolecular Biology and Evolution, 2013
- Phylogeography Takes a Relaxed Random Walk in Continuous Space and TimeMolecular Biology and Evolution, 2010
- Reconstructing influenza incidence by deconvolution of daily mortality time seriesProceedings of the National Academy of Sciences of the United States of America, 2009
- Bayesian analysis of elapsed times in continuous‐time Markov chainsThe Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2008
- How generation intervals shape the relationship between growth rates and reproductive numbersProceedings. Biological sciences, 2006
- Sampling theory for neutral alleles in a varying environmentPhilosophical Transactions B, 1994