کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
552530 1451085 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?
ترجمه فارسی عنوان
رویه خارجی گزارش های استخراج احساسات: آیا می توان روش های فعلی تعصب های جمعیت شناسی، تعصبات رویدادی و دستکاری بررسی ها را تشخیص داد؟
کلمات کلیدی
استخراج معادن، نظر معادن، اعتبار خارجی، تعصب جمعیت شناسی، تعصب رویداد، دستکاری بازبینی محصول، اعتبارسنجی گزاره طرح
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر سیستم های اطلاعاتی
چکیده انگلیسی


• Sentiment mining reports are useless when their external validity cannot be assessed.
• Demographics, events, manipulation are main threats of sentiment mining external validity.
• This article gives meta-requirements and meta-designs of an external validity identifier.
• Automatic demographic, event and manipulation detection in sentiment reports is feasible.
• Sentiment mining services need to be complimented by external validity reports.

Many publications in sentiment mining provide new techniques for improved accuracy in extracting features and corresponding sentiments in texts. For the external validity of these sentiment reports, i.e., the applicability of the results to target audiences, it is important to well analyze data of the context of user-generated content and their sample of authors. The literature lacks an analysis of external validity of sentiment mining reports and the sentiment mining field lacks an operationalization of external validity dimensions toward practically useful techniques. From a kernel theory, we identify multiple threats to sentiment mining external validity and study three of them empirically 1) a mismatch in demographics of the reviewers sample, 2) bias due to reviewers' incidental experiences, and 3) manipulation of reviews. The value of external validity threat identifying techniques is next examined in cases from Goodread.com. We conclude that demographic biases can be well detected by current techniques, although we have doubts regarding stylometric techniques for this purpose. We demonstrate the usefulness of event and manipulation bias detection techniques in our cases, but this result needs further replications in more complex and more competitive contexts. Finally, for increasing the decisional usefulness of sentiment mining reports, they should be accompanied by external validity reports and software and service providers in this field should incorporate these in their offerings.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Decision Support Systems - Volume 59, March 2014, Pages 262–273
نویسندگان
, ,