Gene functional similarity search tool (GFSST)
Open Access
- 14 March 2006
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 7 (1), 135
- https://doi.org/10.1186/1471-2105-7-135
Abstract
Background With the completion of the genome sequences of human, mouse, and other species and the advent of high throughput functional genomic research technologies such as biomicroarray chips, more and more genes and their products have been discovered and their functions have begun to be understood. Increasing amounts of data about genes, gene products and their functions have been stored in databases. To facilitate selection of candidate genes for gene-disease research, genetic association studies, biomarker and drug target selection, and animal models of human diseases, it is essential to have search engines that can retrieve genes by their functions from proteome databases. In recent years, the development of Gene Ontology (GO) has established structured, controlled vocabularies describing gene functions, which makes it possible to develop novel tools to search genes by functional similarity. Results By using a statistical model to measure the functional similarity of genes based on the Gene Ontology directed acyclic graph, we developed a novel Gene Functional Similarity Search Tool (GFSST) to identify genes with related functions from annotated proteome databases. This search engine lets users design their search targets by gene functions. Conclusion An implementation of GFSST which works on the UniProt (Universal Protein Resource) for the human and mouse proteomes is available at GFSST Web Server. GFSST provides functions not only for similar gene retrieval but also for gene search by one or more GO terms. This represents a powerful new approach for selecting similar genes and gene products from proteome databases according to their functions.Keywords
This publication has 17 references indexed in Scilit:
- Exploring relationships and mining data with the UCSC Gene Sorter: Figure 1.Genome Research, 2005
- The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene OntologyNucleic Acids Research, 2004
- Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotationBioinformatics, 2003
- The Gene Ontology Annotation (GOA) Project: Implementation of GO in SWISS-PROT, TrEMBL, and InterProGenome Research, 2003
- Large-Scale Protein Annotation through Gene OntologyGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Decreased expression of BRCA1 accelerates growth and is often present during sporadic breast cancer progressionNature Genetics, 1995
- BRCA1 Mutations in Primary Breast and Ovarian CarcinomasScience, 1994
- p53 function and dysfunctionCell, 1992
- Genetic basis for p53 overexpression in human breast cancer.Proceedings of the National Academy of Sciences of the United States of America, 1991