Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
Open Access
- 18 June 2021
- journal article
- research article
- Published by MDPI AG in Biomolecules
- Vol. 11 (6), 912
- https://doi.org/10.3390/biom11060912
Abstract
The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons’ A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome’s level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.This publication has 56 references indexed in Scilit:
- DAMBE5: A Comprehensive Software Package for Data Analysis in Molecular Biology and EvolutionMolecular Biology and Evolution, 2013
- SSE: a nucleotide and amino acid sequence analysis platformBMC Research Notes, 2012
- Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental dataBioinformatics, 2011
- Codon usage bias and the evolution of influenza A viruses. Codon Usage Biases of Influenza VirusBMC Evolutionary Biology, 2010
- Forces that influence the evolution of codon biasPhilosophical Transactions B, 2010
- Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the NidoviralesVirus Research, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- An evolutionary perspective on synonymous codon usage in unicellular organismsJournal of Molecular Evolution, 1986
- Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genesJournal of Molecular Biology, 1981