کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944239 1437982 2017 36 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Controversy detection in Wikipedia using semantic dissimilarity
ترجمه فارسی عنوان
تشخیص اختلاف در ویکی پدیا با استفاده از اختلاف معنایی
کلمات کلیدی
ویکیپدیا، جنجال، عدم همبستگی معنایی، شباهت جمله پردازش زبان طبیعی، ویرایش شباهت،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
The advent of search engines and wikis has made access to information easy and almost free. Wikipedia is the efficacious outcome of an enormous collaboration, and its peer review-like methods of creation, maintenance, and evolution of contents, ensure high quality and reliability. However, the “anyone-can-edit” policy of Wikipedia has created many problems such as trolling, vandalism, controversies, and doubts about the content and reliability of the information provided due to non-expert involvement. People have tried to identify and rank controversies in Wikipedia articles through various techniques that use quantitative data, ignoring the semantic significance of conflicts among authors. In this paper, we have addressed the problem of identifying controversy using natural language processing techniques for the first time. The proposed method spots the impact on existing meanings of the text due to new editing processes along with their relationship to the topic of the article. The experimental results for precision (0.901), recall (0.901), accuracy (0.908), and F-measure (0.901) demonstrate the effectiveness of the proposed method. The technique is deemed useful for automatic identification of conflicts newly introduced into existing article contents, and could prove helpful in making decisions for inclusion or exclusion of controversies under the same topic.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 418–419, December 2017, Pages 581-600
نویسندگان
, , , , ,