Variant Analysis and Strategic Clustering to Sub-Lineage of Double Mutant Strain B.1.617 of SARS-CoV-2
Open Access
- 20 April 2022
- Vol. 2 (5), 513-531
- https://doi.org/10.3390/covid2050038
Abstract
SARS-CoV-2 is an RNA coronavirus responsible for Acute Respiratory Syndrome (COVID-19). In January 2021, the re-occurrence of COVID-19 infection was at its peak, considered the second wave of epidemics. In the initial stage, it was considered a double mutant strain due to two significant mutations observed in their Spike protein (E484Q and L452R). Although it was first detected in India later on, it was spread to several countries worldwide, causing high fatality due to this strain. In the present study, we investigated the spreading of B.1.617 strain worldwide through 822 genome sequences submitted in GISAID on 21 April 2021. All genome sequences were analyzed for variations in genome sequences based on their effects due to changes in nucleotides. At Allele frequency 0.05, there were a total of 47 variations in ORF1ab, 22 in Spike protein gene, 6 variations in N gene, 5 in ORF8 and M gene, four mutations in Orf7a, and one nucleotide substitution observed for ORF3a, ORF6 and ORF7b gene. The clustering for similar mutations mentioned B.1.617 sub-lineages. The outcome of this study established relative occurrence and spread worldwide. The study’s finding represented that “double mutant” strain is not only spread through traveling but it is also observed to evolve naturally with different mutations observed in B.1.617 lineage. The information extracted from the study helps to understand viral evolution and genome variations of B.1.617 lineage. The results support the need of separating B.1.617 into sub-lineages.This publication has 51 references indexed in Scilit:
- GISAID: Global initiative on sharing all influenza data – from vision to realityEurosurveillance, 2017
- Mechanisms of viral mutationCellular and Molecular Life Sciences, 2016
- Accessory proteins of SARS-CoV and other coronavirusesAntiviral Research, 2014
- LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasetsNucleic Acids Research, 2012
- A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEffFly, 2012
- Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSiftFrontiers in Genetics, 2012
- Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal OmegaMolecular Systems Biology, 2011
- Dindel: Accurate indel calls from short-read dataGenome Research, 2010
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- Structure and Intracellular Targeting of the SARS-Coronavirus Orf7a Accessory ProteinStructure, 2005