A policy gradient method for semi-Markov decision processes with application to call admission control

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
483225	1446201	2007	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Semi-Markov decision process - روند تصمیم گیری نیمه مارکوف Policy gradient - شیب خط مشی Stochastic processes - فرایندهای تصادفی Call admission control - کنترل پذیرش تماس بگیرید

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش صفحه اول مقاله

A policy gradient method for semi-Markov decision processes with application to call admission control

چکیده انگلیسی

Solving a semi-Markov decision process (SMDP) using value or policy iteration requires precise knowledge of the probabilistic model and suffers from the curse of dimensionality. To overcome these limitations, we present a reinforcement learning approach where one optimizes the SMDP performance criterion with respect to a family of parameterised policies. We propose an online algorithm that simultaneously estimates the gradient of the performance criterion and optimises it using stochastic approximation. We apply our algorithm to call admission control. Our proposed policy gradient SMDP algorithm and its application to admission control is novel.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 178, Issue 3, 1 May 2007, Pages 808–818

نویسندگان

Sumeetpal S. Singh, Vladislav B. Tadić, Arnaud Doucet,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A policy gradient method for semi-Markov decision processes with application to call admission control

دسترسی سریع

ارتباط

English Website