کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
514965 866926 2015 20 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Feature-based approaches to semantic similarity assessment of concepts using Wikipedia
ترجمه فارسی عنوان
روش های مبتنی بر ویژگی برای ارزیابی شباهت معنایی مفاهیم با استفاده از ویکیپدیا
کلمات کلیدی
شباهت مفهوم؛ شباهت معنایی؛ ارتباط معنایی؛ اقدامات مبتنی بر ویژگی؛ ویکیپدیا
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• A formal representation of Wikipedia concepts is presented.
• A framework for feature based similarity is proposed.
• Some novel feature based approaches to semantic similarity measures are presented.
• Results show that several proposed methods have good human correlation.

Semantic similarity assessment between concepts is an important task in many language related applications. In the past, several approaches to assess similarity by evaluating the knowledge modeled in an (or multiple) ontology (or ontologies) have been proposed. However, there are some limitations such as the facts of relying on predefined ontologies and fitting non-dynamic domains in the existing measures. Wikipedia provides a very large domain-independent encyclopedic repository and semantic network for computing semantic similarity of concepts with more coverage than usual ontologies. In this paper, we propose some novel feature based similarity assessment methods that are fully dependent on Wikipedia and can avoid most of the limitations and drawbacks introduced above. To implement similarity assessment based on feature by making use of Wikipedia, firstly a formal representation of Wikipedia concepts is presented. We then give a framework for feature based similarity based on the formal representation of Wikipedia concepts. Lastly, we investigate several feature based approaches to semantic similarity measures resulting from instantiations of the framework. The evaluation, based on several widely used benchmarks and a benchmark developed in ourselves, sustains the intuitions with respect to human judgements. Overall, several methods proposed in this paper have good human correlation and constitute some effective ways of determining similarity between Wikipedia concepts.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 51, Issue 3, May 2015, Pages 215–234
نویسندگان
, , , ,