MUSTANG: A multiple structural alignment algorithm
- 14 August 2006
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 64 (3), 559-574
- https://doi.org/10.1002/prot.20921
Abstract
Multiple structural alignment is a fundamental problem in structural genomics. In this article, we define a reliable and robust algorithm, MUSTANG (MUltiple STructural AligNment AlGorithm), for the alignment of multiple protein structures. Given a set of protein structures, the program constructs a multiple alignment using the spatial information of the C-alpha atoms in the set. Broadly based on the progressive pairwise heuristic, this algorithm gains accuracy through novel and effective refinement phases. MUSTANG reports the multiple sequence alignment and the corresponding superposition of structures. Alignments generated by MUSTANG are compared with several hand-curated alignments in the literature as well as with the benchmark alignments of 1033 alignment families from the HOMSTRAD database. The performance of MUSTANG was compared with DALI at a pairwise level, and with other multiple structural alignment tools such as POSA, CE-MC, MALECON, and MultiProt. MUSTANG performs comparably to popular pairwise and multiple structural alignment tools for closely related proteins, and performs more reliably than other multiple structural alignment methods on hard data sets containing distantly related proteins or proteins that show conformational. changes. (c) 2006 Wiley-Liss, Inc.This publication has 62 references indexed in Scilit:
- Structural divergence and distant relationships in proteins: evolution of the globinsCurrent Opinion in Structural Biology, 2005
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Computational Complexity of Multiple Sequence Alignment with SP-ScoreJournal of Computational Biology, 2001
- T-coffee: a novel method for fast and accurate multiple sequence alignmentJournal of Molecular Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- A rapid method of protein structure alignmentJournal of Theoretical Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Definition of general topological equivalence in protein structures: A procedure involving comparison of properties and relationships through simulated annealing and dynamic programmingJournal of Molecular Biology, 1990
- How different amino acid sequences determine similar protein structures: The structure and evolutionary dynamics of the globinsJournal of Molecular Biology, 1980