Robust Normal Two-Armed Bandit, One Arm Known, and Parallel Data Processing

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
713019	892161	2013	6 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی

پیش نمایش صفحه اول مقاله

Robust Normal Two-Armed Bandit, One Arm Known, and Parallel Data Processing

چکیده انگلیسی

We consider the two-armed bandit problem in the following robust (minimax) setting. Distributions of rewards corresponding to the first arm have known finite mathematical expectation. Distributions of rewards corresponding to the second arm are normal ones with unknown mathematical expectation and unit variance. According to the main theorem of the theory of games minimax strategy and minimax risk are searched as Bayes ones corresponding to the worst prior distribution. In considered case, the worst prior distribution is concentrated in two points. This allows one to use numerical optimization. Results are applied to systems with parallel data processing including distributions other than normal.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: IFAC Proceedings Volumes - Volume 46, Issue 11, 2013, Pages 263-268

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Robust Normal Two-Armed Bandit, One Arm Known, and Parallel Data Processing

دسترسی سریع

ارتباط

English Website