کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
9505968 1340365 2005 36 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Arbitrary side observations in bandit problems
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات ریاضیات کاربردی
پیش نمایش صفحه اول مقاله
Arbitrary side observations in bandit problems
چکیده انگلیسی
A bandit problem with side observations is an extension of the traditional two-armed bandit problem, in which the decision maker has access to side information before deciding which arm to pull. In this paper, essential properties of the side observations that allow achievability results with respect to optimal regret are extracted and formalized. The sufficient conditions for good side information obtained here admit various types of random processes as special cases, including i.i.d. sequences, Markov chains, deterministic periodic sequences, etc. A simple necessary condition for optimal regret is given, providing further insight into the nature of bandit problems with side observations. A game-theoretic approach simplifies the analysis and justifies the viewpoint that the side observation serves as an index specifying different sub-bandit machines.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Advances in Applied Mathematics - Volume 34, Issue 4, May 2005, Pages 903-938
نویسندگان
, , ,