Evaluation of GO-based functional similarity measures using S. cerevisiae protein interaction and expression profile data
Open Access
- 6 November 2008
- journal article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 9 (1), 472
- https://doi.org/10.1186/1471-2105-9-472
Abstract
Researchers interested in analysing the expression patterns of functionally related genes usually hope to improve the accuracy of their results beyond the boundaries of currently available experimental data. Gene ontology (GO) data provides a novel way to measure the functional relationship between gene products. Many approaches have been reported for calculating the similarities between two GO terms, known as semantic similarities. However, biologists are more interested in the relationship between gene products than in the scores linking the GO terms. To highlight the relationships among genes, recent studies have focused on functional similarities. In this study, we evaluated five functional similarity methods using both protein-protein interaction (PPI) and expression data of S. cerevisiae. The receiver operating characteristics (ROC) and correlation coefficient analysis of these methods showed that the maximum method outperformed the other methods. Statistical comparison of multiple- and single-term annotated proteins in biological process ontology indicated that genes with multiple GO terms may be more reliable for separating true positives from noise. This study demonstrated the reliability of current approaches that elevate the similarity of GO terms to the similarity of proteins. Suggestions for further improvements in functional similarity analysis are also provided.Keywords
This publication has 29 references indexed in Scilit:
- Uncovering signal transduction networks from high-throughput data by integer linear programmingNucleic Acids Research, 2008
- Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequencesNucleic Acids Research, 2008
- FunSimMat: a comprehensive functional similarity databaseNucleic Acids Research, 2007
- Information theory applied to the sparse gene ontology annotation network to predict novel gene functionBioinformatics, 2007
- Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing mapsJournal of Biomedical Informatics, 2007
- Correlation between Gene Expression and GO Semantic SimilarityIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2005
- Probabilistic model of the human protein-protein interaction networkNature Biotechnology, 2005
- A graph-theoretic modeling on GO space for biological interpretation of gene clustersBioinformatics, 2004
- Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotationBioinformatics, 2003
- The Transcriptional Program of Sporulation in Budding YeastScience, 1998