SCUMBLE: a method for systematic and accurate detection of codon usage bias by maximum likelihood estimation
Open Access
- 21 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (11), 3819-3827
- https://doi.org/10.1093/nar/gkn288
Abstract
The genetic code is degenerate—most amino acids can be encoded by from two to as many as six different codons. The synonymous codons are not used with equal frequency: not only are some codons favored over others, but also their usage can vary significantly from species to species and between different genes in the same organism. Known causes of codon bias include differences in mutation rates as well as selection pressure related to the expression level of a gene, but the standard analysis methods can account for only a fraction of the observed codon usage variation. We here introduce an explicit model of codon usage bias, inspired by statistical physics. Combining this model with a maximum likelihood approach, we are able to clearly identify different sources of bias in various genomes. We have applied the algorithm to Saccharomyces cerevisiae as well as 325 prokaryote genomes, and in most cases our model explains essentially all observed variance.This publication has 39 references indexed in Scilit:
- Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noiseNature, 2006
- Codon Usage Domains over Bacterial ChromosomesPLoS Computational Biology, 2006
- A problem in multivariate analysis of codon usage data and a possible solutionFEBS Letters, 2005
- Intragenic Spatial Patterns of Codon Usage Bias in Prokaryotic and Eukaryotic GenomesGenetics, 2004
- Online synonymous codon usage analyses with the ade4 and seqinR packagesBioinformatics, 2004
- The ‘effective number of codons’ used in a geneGene, 1990
- An evolutionary perspective on synonymous codon usage in unicellular organismsJournal of Molecular Evolution, 1986
- Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes: Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAsJournal of Molecular Biology, 1982
- Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: A proposal for a synonymous codon choice that is optimal for the E. coli translational systemJournal of Molecular Biology, 1981
- Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genesJournal of Molecular Biology, 1981