کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
479800 1446020 2014 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust decomposable Markov decision processes motivated by allocating school budgets
ترجمه فارسی عنوان
تصمیم گیری مستقیم تجزیه آمیز مارکوف از طریق تخصیص بودجه مدرسه ایجاد شده است
کلمات کلیدی
فرآیندهای مارکوف، کنترل پویا برنامه نویسی بهینه، بودجه مدرسه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی


• We introduce and formalize the concept of robust, decomposable MDPs.
• We propose a tractable method to compute good policies in such MDPs.
• Funding policies from these MDPs outperform strategies based on a real-world policy.

Motivated by an application to school funding, we introduce the notion of a robust decomposable Markov decision process (MDP). A robust decomposable MDP model applies to situations where several MDPs, with the transition probabilities in each only known through an uncertainty set, are coupled together by joint resource constraints. Robust decomposable MDPs are different than both decomposable MDPs, and robust MDPs and cannot be solved by a direct application of the solution methods from either of those areas. In fact, to the best of our knowledge, there is no known method to tractably compute optimal policies in robust, decomposable MDPs. We show how to tractably compute good policies for this model, and apply the derived method to a stylized school funding example.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 239, Issue 1, 16 November 2014, Pages 199–213
نویسندگان
, , ,