کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
383630 660828 2014 21 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Dynamic estimation of worker reliability in crowdsourcing for regression tasks: Making it work
ترجمه فارسی عنوان
برآورد دینامیک قابلیت اطمینان کارکنان در جمع بندی برای وظایف رگرسیون: ایجاد آن کار
کلمات کلیدی
نظارت بر یادگیری ماشین، برون سپاری، قابلیت اطمینان کارکنان، راهزنان چند مسلح، شناخت احساسی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• We propose DER3, a novel approach to estimate worker reliability in crowdsourcing dynamically.
• Constant rater availability is not required.
• A large number of tasks with correct answers known in advance is not required.
• DER3 is specifically suited for regression tasks, but can also be used in different settings.
• DER3 leads to the results of high accuracy and low costs in different application domains.

One of the biggest challenges in crowdsourcing is detecting noisy and incompetent workers. A possible way of handling this problem is to dynamically estimate the reliability of workers as they do work and accept only those workers who are deemed to be reliable to date. Although many approaches to dynamic estimation of rater reliability exist, they are often only appropriate for very specific categories of tasks, for example, only for binary classification. They also can make unrealistic assumptions such as requiring access to a large number of gold standard answers or relying on the constant availability of any rater. In this paper, we propose a novel approach to the dynamic estimation of rater reliability in regression (DER3) using multi-armed bandits. This approach is specifically suited for real-life crowdsourcing scenarios, where the task at hand is labelling or rating corpora to be used in supervised machine learning, and the annotations are continuous ratings, although it can be easily generalised to multi-class or binary classification tasks. We demonstrate that DER3 provides high-accuracy results and at the same time keeps the cost of the rating process low. Although our main motivating example is the recognition of emotion in speech, our approach shows similar results in other application areas.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 41, Issue 14, 15 October 2014, Pages 6190–6210
نویسندگان
, , ,