Computing Semantic Similarities Based on Machine-Readable Dictionaries

Abstract

The measurement of semantic similarity is a foundation work in semantic computing. In this paper the authors study the similarity measure between two words. Different from previous works, this paper suggests a novel method that relies on machine-readable dictionaries for measuring similarities. Machine-readable dictionaries are more widely available than other kinds of lexical resources. If two words have similar definitions, they are semantically similar. A definition is represented by a definition vector. Each dimension represents a word in the dictionary. The score of each dimension in the vector is calculated by a variation of tf*idf. Evaluations show that this method achieves competitive results in both Chinese and English.

Keywords

This publication has 13 references indexed in Scilit:

A Web Search Engine-Based Approach to Measure Semantic Similarity between Words
IEEE Transactions on Knowledge and Data Engineering, 2010
Chinese Question Classification Using Combination Approach
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Chinese segmentation and new word detection using conditional random fields
Published by Association for Computational Linguistics (ACL) ,2004
An approach for measuring semantic similarity between words using multiple information sources
IEEE Transactions on Knowledge and Data Engineering, 2003
MindNet
Published by Association for Computational Linguistics (ACL) ,1998
WordNet
Communications of the ACM, 1995
Combining corpus and machine-readable dictionary data for building bilingual lexicons
Machine Translation, 1995
The acquisition of lexical knowledge from combined machine-readable dictionary sources
Published by Association for Computational Linguistics (ACL) ,1992
Contextual correlates of semantic similarity
Language and Cognitive Processes, 1991
Automatic sense disambiguation using machine readable dictionaries
Published by Association for Computing Machinery (ACM) ,1986

Cited by 6 articles