کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
713019 | 892161 | 2013 | 6 صفحه PDF | دانلود رایگان |
عنوان انگلیسی مقاله ISI
Robust Normal Two-Armed Bandit, One Arm Known, and Parallel Data Processing
دانلود مقاله + سفارش ترجمه
دانلود مقاله ISI انگلیسی
رایگان برای ایرانیان
موضوعات مرتبط
مهندسی و علوم پایه
سایر رشته های مهندسی
مکانیک محاسباتی
پیش نمایش صفحه اول مقاله

چکیده انگلیسی
We consider the two-armed bandit problem in the following robust (minimax) setting. Distributions of rewards corresponding to the first arm have known finite mathematical expectation. Distributions of rewards corresponding to the second arm are normal ones with unknown mathematical expectation and unit variance. According to the main theorem of the theory of games minimax strategy and minimax risk are searched as Bayes ones corresponding to the worst prior distribution. In considered case, the worst prior distribution is concentrated in two points. This allows one to use numerical optimization. Results are applied to systems with parallel data processing including distributions other than normal.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: IFAC Proceedings Volumes - Volume 46, Issue 11, 2013, Pages 263-268
Journal: IFAC Proceedings Volumes - Volume 46, Issue 11, 2013, Pages 263-268