Dynamic Programming Alignment Accuracy

1 January 1998

journal article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 5 (3), 493-504
https://doi.org/10.1089/cmb.1998.5.493

Abstract

Algorithms for generating alignments of biological sequences have inherent statistical limitations when it comes to the accuracy of the alignments they produce. Using simulations, we measure the accuracy of the standard global dynamic programming method and show that it can be reasonably well modelled by an "edge wander" approximation to the distribution of the optimal scoring path around the correct path in the vicinity of a gap. We also give a table from which accuracy values can be predicted for commonly used scoring schemes and sequence divergences (the PAM and BLOSUM series). Finally we describe how to calculate the expected accuracy of a given alignment, and show how this can be used to construct an optimal accuracy alignment algorithm which generates significantly more accurate alignments than standard dynamic programming methods in simulated experiments.

Keywords

This publication has 10 references indexed in Scilit:

Significant Improvement in Accuracy of Multiple Protein Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments
Journal of Molecular Biology, 1996
Similarity Detection and Localization
Physical Review Letters, 1996
Quantifying the local reliability of a sequence alignment
Protein Engineering, Design and Selection, 1996
A reliable sequence alignment method based on probabilities of residue correspondences
Protein Engineering, Design and Selection, 1995
Sequence alignment and penalty choice
Journal of Molecular Biology, 1994
Inching toward reality: An improved likelihood model of sequence evolution
Journal of Molecular Evolution, 1992
Basic local alignment search tool
Journal of Molecular Biology, 1990
Improved tools for biological sequence comparison.
Proceedings of the National Academy of Sciences, 1988
Identification of common molecular subsequences
Journal of Molecular Biology, 1981
A general method applicable to the search for similarities in the amino acid sequence of two proteins
Journal of Molecular Biology, 1970

Cited by 89 articles