INDELible: A Flexible Simulator of Biological Sequence Evolution
Top Cited Papers
Open Access
- 7 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 26 (8), 1879-1888
- https://doi.org/10.1093/molbev/msp098
Abstract
Many methods exist for reconstructing phylogenies from molecular sequence data, but few phylogenies are known and can be used to check their efficacy. Simulation remains the most important approach to testing the accuracy and robustness of phylogenetic inference methods. However, current simulation programs are limited, especially concerning realistic models for simulating insertions and deletions. We implement a portable and flexible application, named INDELible, for generating nucleotide, amino acid and codon sequence data by simulating insertions and deletions (indels) as well as substitutions. Indels are simulated under several models of indel-length distribution. The program implements a rich repertoire of substitution models, including the general unrestricted model and nonstationary nonhomogeneous models of nucleotide substitution, mixture, and partition models that account for heterogeneity among sites, and codon models that allow the nonsynonymous/synonymous substitution rate ratio to vary among sites and branches. With its many unique features, INDELible should be useful for evaluating the performance of many inference methods, including those for multiple sequence alignment, phylogenetic tree inference, and ancestral sequence, or genome reconstruction.Keywords
This publication has 75 references indexed in Scilit:
- Problems and Solutions for Estimating Indel Rates and Length DistributionsMolecular Biology and Evolution, 2008
- An Improved General Amino Acid Replacement MatrixMolecular Biology and Evolution, 2008
- Tools for simulating evolution of aligned genomic regions with integrated parameter estimationGenome Biology, 2008
- An initial map of insertion and deletion (INDEL) variation in the human genomeGenome Research, 2006
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- rtREV: An Amino Acid Substitution Matrix for Inference of Retrovirus and Reverse Transcriptase PhylogenyJournal of Molecular Evolution, 2002
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980
- Exact stochastic simulation of coupled chemical reactionsThe Journal of Physical Chemistry, 1977