کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
720765 892300 2007 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
MINIMAX R-STAGE STRATEGY FOR THE MULTI-ARMED BANDIT PROBLEM
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مکانیک محاسباتی
پیش نمایش صفحه اول مقاله
MINIMAX R-STAGE STRATEGY FOR THE MULTI-ARMED BANDIT PROBLEM
چکیده انگلیسی

The r-stage multi-armed bandit problem is considered in minimax setting on the finite sufficiently large time interval T. A sequential control procedure with a priori specified magnitudes of learning stages and thresholds is offered. The value of the minimax risk close to Tα with α = 2r–1/(2r – 1) is obtained. The applications to information transmission and medical treatments are discussed. Considered approach is especially valuable for systems with parallel processing in which the number of stages r mainly influences the total duration of the process.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: IFAC Proceedings Volumes - Volume 40, Issue 13, 2007, Pages 380–385
نویسندگان
, ,