Exploiting disjointness axioms to improve semantic similarity measures
Open Access
- 3 September 2013
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 29 (21), 2781-2787
- https://doi.org/10.1093/bioinformatics/btt491
Abstract
Motivation: Representing domain knowledge in biology has traditionally been accomplished by creating simple hierarchies of classes with textual annotations. Recently, expressive ontology languages, such as Web Ontology Language, have become more widely adopted, supporting axioms that express logical relationships other than class–subclass, e.g. disjointness. This is improving the coverage and validity of the knowledge contained in biological ontologies. However, current semantic tools still need to adapt to this more expressive information. In this article, we propose a method to integrate disjointness axioms, which are being incorporated in real-world ontologies, such as the Gene Ontology and the chemical entities of biological interest ontology, into semantic similarity, the measure that estimates the closeness in meaning between classes. Results: We present a modification of the measure of shared information content, which extends the base measure to allow the incorporation of disjointness information. To evaluate our approach, we applied it to several randomly selected datasets extracted from the chemical entities of biological interest ontology. In 93.8% of these datasets, our measure performed better than the base measure of shared information content. This supports the idea that semantic similarity is more accurate if it extends beyond the hierarchy of classes of the ontology. Contact: joao.ferreira@lasige.di.fc.ul.pt Supplementary information: Supplementary data are available at Bioinformatics online.This publication has 14 references indexed in Scilit:
- Enhancement of Chemical Entity Identification in Text Using Semantic Similarity ValidationPLOS ONE, 2013
- The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013Nucleic Acids Research, 2012
- Structure-based classification and ontology in chemistryJournal of Cheminformatics, 2012
- Disjunctive shared information between ontology concepts: application to Gene OntologyJournal of Biomedical Semantics, 2011
- Semantic Similarity for Automatic Classification of Chemical CompoundsPLoS Computational Biology, 2010
- Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in OntologiesAmerican Journal of Human Genetics, 2009
- Metrics for GO based protein semantic similarity: a systematic evaluationBMC Bioinformatics, 2008
- ChEBI: a database and ontology for chemical entities of biological interestNucleic Acids Research, 2007
- A novel view on information content of concepts in a large ontology and a view on the structure and the quality of the ontologyInternational Journal of Medical Informatics, 2005
- On the Properties of Bit String-Based Measures of Chemical SimilarityJournal of Chemical Information and Computer Sciences, 1998