Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Open Access
- 1 September 1997
- journal article
- review article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 25 (17), 3389-3402
- https://doi.org/10.1093/nar/25.17.3389
Abstract
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.This publication has 67 references indexed in Scilit:
- Embedding strategies for effective use of information from multiple sequence alignmentsProtein Science, 1997
- Identification of a RING protein that can interact in vivo with the BRCA1 gene productNature Genetics, 1996
- Sequence Analysis of the Genome of the Unicellular Cyanobacterium Synechocystis sp. Strain PCC6803. II. Sequence Determination of the Entire Genome and Assignment of Potential Protein-coding RegionsDNA Research, 1996
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995
- Position-based sequence weightsJournal of Molecular Biology, 1994
- Volume changes in protein evolutionJournal of Molecular Biology, 1994
- Amino acid substitution matrices from an information theoretic perspectiveJournal of Molecular Biology, 1991
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Systematic method for the detection of potential λ Cro-like DNA-binding regions in proteinsJournal of Molecular Biology, 1987
- Selection of DNA binding sites by regulatory proteins: Statistical-mechanical theory and application to operators and promotersJournal of Molecular Biology, 1987