MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6595838	458550	2013	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Mathematical programming - برنامه ریزی ریاضی Dynamic programming - برنامه‌ریزی پویا یا برنامه‌ نویسی پویا Network reliability - قابلیت اطمینان شبکه Partial observation - مشاهده جزئی Markov decision processes - پروسه تصمیم گیری مارکوف

موضوعات مرتبط

مهندسی و علوم پایه مهندسی شیمی مهندسی شیمی (عمومی)

پیش نمایش صفحه اول مقاله

MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

چکیده انگلیسی

Partially observed Markov decision processes (POMDPs) serve as powerful tools to model stochastic systems with partial state information. Since the exact solution methods for POMDPs are limited to problems with very small sizes of state, action and observation spaces, approximate point-based solution methods like PERSEUS have gained popularity. In this work, a mixed integer linear program (MILP) is developed for calculation of exact value updates (in PERSEUS and similar algorithms), when the POMDP has very large or continuous action space. Since the solution time of the MILP is very sensitive to the size of the observation space, the concept of post-decision belief space is introduced to generate a more efficient and flexible model. An example involving a flow network is presented to illustrate the concepts and compare the results with those of the existing techniques.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Chemical Engineering - Volume 56, 13 September 2013, Pages 101-113

نویسندگان

Rakshita Agrawal, Matthew J. Realff, Jay H. Lee,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

دسترسی سریع

ارتباط

English Website