A SARS-CoV-2 vaccine candidate would likely match all currently circulating variants
Open Access
- 22 September 2020
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 117 (38), 23652-23662
- https://doi.org/10.1073/pnas.2008281117
Abstract
The magnitude of the COVID-19 pandemic underscores the urgency for a safe and effective vaccine. Many vaccine candidates focus on the Spike protein, as it is targeted by neutralizing antibodies and plays a key role in viral entry. Here we investigate the diversity seen in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequences and compare it to the sequence on which most vaccine candidates are based. Using 18,514 sequences, we perform phylogenetic, population genetics, and structural bioinformatics analyses. We find limited diversity across SARS-CoV-2 genomes: Only 11 sites show polymorphisms in >5% of sequences; yet two mutations, including the D614G mutation in Spike, have already become consensus. Because SARS-CoV-2 is being transmitted more rapidly than it evolves, the viral population is becoming more homogeneous, with a median of seven nucleotide substitutions between genomes. There is evidence of purifying selection but little evidence of diversifying selection, with substitution rates comparable across structural versus nonstructural genes. Finally, the Wuhan-Hu-1 reference sequence for the Spike protein, which is the basis for different vaccine candidates, matches optimized vaccine inserts, being identical to an ancestral sequence and one mutation away from the consensus. While the rapid spread of the D614G mutation warrants further study, our results indicate that drift and bottleneck events can explain the minimal diversity found among SARS-CoV-2 sequences. These findings suggest that a single vaccine candidate should be efficacious against currently circulating lineages.Keywords
This publication has 71 references indexed in Scilit:
- MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and UsabilityMolecular Biology and Evolution, 2013
- Detecting Individual Sites Subject to Episodic Diversifying SelectionPLoS Genetics, 2012
- A Random Effects Branch-Site Model for Detecting Episodic Diversifying SelectionMolecular Biology and Evolution, 2011
- and D do not replace FSTMolecular Ecology, 2011
- FastTree 2 – Approximately Maximum-Likelihood Trees for Large AlignmentsPLOS ONE, 2010
- The Population Genetics of dN/dSPLoS Genetics, 2008
- Rates of evolutionary change in viruses: patterns and determinantsNature Reviews Genetics, 2008
- Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under SelectionMolecular Biology and Evolution, 2005
- Diversity Considerations in HIV-1 Vaccine SelectionScience, 2002
- Analysis of Gene Diversity in Subdivided PopulationsProceedings of the National Academy of Sciences of the United States of America, 1973