دانلود رایگان مقاله: روش پاداش های مثبت برای مشکل راهزنی چند مسلح

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6863650	1439517	2018	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Possibilistic reward methods for the multi-armed bandit problem

ترجمه فارسی عنوان

روش پاداش های مثبت برای مشکل راهزنی چند مسلح

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

مشکل چند باند مسلح، پاداش مثبت، مطالعه عددی،

Multi-armed bandit problem - مشکل چند گانه مسلحانه Numerical study - مطالعه عددی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

روش پاداش های مثبت برای مشکل راهزنی چند مسلح

چکیده انگلیسی

In this paper, we propose a set of allocation strategies to deal with the multi-armed bandit problem, the possibilistic reward (PR) methods. First, we use possibilistic reward distributions to model the uncertainty about the expected rewards from the arm, derived from a set of infinite confidence intervals nested around the expected value. Depending on the inequality used to compute the confidence intervals, there are three possible PR methods with different features. Next, we use a pignistic probability transformation to convert these possibilistic functions into probability distributions following the insufficient reason principle. Finally, Thompson sampling techniques are used to identify the arm with the higher expected reward and play that arm. A numerical study analyses the performance of the proposed methods with respect to other policies in the literature. Two PR methods perform well in all representative scenarios under consideration, and are the best allocation strategies if truncated poisson or exponential distributions in [0,10] are considered for the arms.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 310, 8 October 2018, Pages 201-212

نویسندگان

Miguel MartÃn, Antonio Jiménez-MartÃn, Alfonso Mateos,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : روش پاداش های مثبت برای مشکل راهزنی چند مسلح

دسترسی سریع

ارتباط

English Website