Article ID Journal Published Year Pages File Type
384110 Expert Systems with Applications 2012 11 Pages PDF
Abstract

Estimation of the semantic likeness between words is of great importance in many applications dealing with textual data such as natural language processing, knowledge acquisition and information retrieval. Semantic similarity measures exploit knowledge sources as the base to perform the estimations. In recent years, ontologies have grown in interest thanks to global initiatives such as the Semantic Web, offering an structured knowledge representation. Thanks to the possibilities that ontologies enable regarding semantic interpretation of terms many ontology-based similarity measures have been developed. According to the principle in which those measures base the similarity assessment and the way in which ontologies are exploited or complemented with other sources several families of measures can be identified. In this paper, we survey and classify most of the ontology-based approaches developed in order to evaluate their advantages and limitations and compare their expected performance both from theoretical and practical points of view. We also present a new ontology-based measure relying on the exploitation of taxonomical features. The evaluation and comparison of our approach’s results against those reported by related works under a common framework suggest that our measure provides a high accuracy without some of the limitations observed in other works.

► An up-to-date survey of ontology-based semantic similarity measures is presented. ► A feature-based measure is proposed based on the exploitation of taxonomic knowledge. ► It retains a low computational complexity without depending on corpora or ad-hoc parameters. ► Our measure achieves a high accuracy when evaluated with commonly used benchmarks.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,