دانلود رایگان مقاله: برنامه ریزی پویا تقریبی برای مشکلات کنترل خطی تصادفی در فضاهای حالت فشرده

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
479605	1446014	2015	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Approximate dynamic programming for stochastic linear control problems on compact state spaces

ترجمه فارسی عنوان

برنامه ریزی پویا تقریبی برای مشکلات کنترل خطی تصادفی در فضاهای حالت فشرده

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

برنامه نویسی دینامیک، فرآیندهای مارکوف، فهرست، منابع دوگانه، تامین منابع چندگانه

Inventory - موجودی کالا Dynamic programming - برنامه‌ریزی پویا یا برنامه‌ نویسی پویا Multiple sourcing - تامین منابع چندگانه Markov processes - فرآیندهای مارکوف Dual sourcing - منابع دوگانه

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

برنامه ریزی پویا تقریبی برای مشکلات کنترل خطی تصادفی در فضاهای حالت فشرده

چکیده انگلیسی

• Proof of existence of optimal policies for stochastic linear control problems.
• Proof of existence of convex bounded solutions to Average Cost Optimality Equation.
• New ADP algorithm providing good policies and lower bounds to the optimal costs.
• Tests of algorithm on multiple sourcing where optimality gap can be bounded by 5%.
• On all tested instances algorithm performs better than state-of-the-art heuristic.

This paper addresses Markov Decision Processes over compact state and action spaces. We investigate the special case of linear dynamics and piecewise-linear and convex immediate costs for the average cost criterion. This model is very general and covers many interesting examples, for instance in inventory management. Due to the curse of dimensionality, the problem is intractable and optimal policies usually cannot be computed, not even for instances of moderate size.We show the existence of optimal policies and of convex and bounded relative value functions that solve the average cost optimality equation under reasonable and easy-to-check assumptions. Based on these insights, we propose an approximate relative value iteration algorithm based on piecewise-linear convex relative value function approximations. Besides computing good policies, the algorithm also provides lower bounds to the optimal average cost, which allow us to bound the optimality gap of any given policy for a given instance.The algorithm is applied to the well-studied Multiple Sourcing Problem as known from inventory management. Multiple sourcing is known to be a hard problem and usually tackled by parametric heuristics. We analyze several MSP instances with two and more suppliers and compare our results to state-of-the-art heuristics. For the considered scenarios, our policies are always at least as good as the best known heuristic, and strictly better in most cases. Moreover, by using the computed lower bounds we show for all instances that the optimality gap has never exceeded 5%, and that it has been much smaller for most of them.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: European Journal of Operational Research - Volume 241, Issue 1, 16 February 2015, Pages 85–98

نویسندگان

Stefan Woerner, Marco Laumanns, Rico Zenklusen, Apostolos Fertis,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : برنامه ریزی پویا تقریبی برای مشکلات کنترل خطی تصادفی در فضاهای حالت فشرده

دسترسی سریع

ارتباط

English Website