Rapid and Sensitive Protein Similarity Searches

22 March 1985

journal article
research article
Published by American Association for the Advancement of Science (AAAS) in Science

Vol. 227 (4693), 1435-1441
https://doi.org/10.1126/science.2983426

Abstract

An algorithm was developed which facilitates the search for similarities between newly determined amino acid sequences and sequences already available in databases. Because of the algorithm's efficiency on many microcomputers, sensitive protein database searches may now become a routine procedure for molecular biologists. The method efficiently identifies regions of similar sequence and then scores the aligned identical and differing residues in those regions by means of an amino acid replacability matrix. This matrix increases sensitivity by giving high scores to those amino acid replacements which occur frequently in evolution. The algorithm has been implemented in a computer program designed to search protein databases very rapidly. For example, comparison of a 200-amino-acid sequence to the 500,000 residues in the National Biomedical Research Foundation library would take less than 2 minutes on a minicomputer, and less than 10 minutes on a microcomputer (IBM PC).

Keywords

This publication has 23 references indexed in Scilit:

Sequence relationships between putative T-cell receptor polypeptides and immunoglobulins
Nature, 1984
Angiotensinogen Is Related to the Antitrypsin-Antithrombin-Ovalbumin Family
Science, 1983
New approaches for computer analysis of nucleic acid sequences.
Proceedings of the National Academy of Sciences of the United States of America, 1983
Simian Sarcoma Virus onc Gene, v- sis , Is Derived from the Gene (or Genes) Encoding a Platelet-Derived Growth Factor
Science, 1983
An Overview of Sequence Comparison: Time Warps, String Edits, and Macromolecules
SIAM Review, 1983
Viral src gene products are related to the catalytic chain of mammalian cAMP-dependent protein kinase.
Proceedings of the National Academy of Sciences of the United States of America, 1982
Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries
Nucleic Acids Research, 1982
Efficient algorithms for folding and comparing nucleic acid sequences
Nucleic Acids Research, 1982
Similar Amino Acid Sequences: Chance or Common Ancestry?
Science, 1981
A surprising new protein superfamily containing ovalbumin, antithrombin-III, and alpha1-proteinase inhibitor
Biochemical and Biophysical Research Communications, 1980

Cited by 3286 articles