کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6892735 699056 2017 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Least squares approximate policy iteration for learning bid prices in choice-based revenue management
ترجمه فارسی عنوان
کمترین مربعات تقریبی تکرار سیاست برای یادگیری قیمت پیشنهادات در مدیریت درآمد مبتنی بر انتخاب
کلمات کلیدی
مدیریت درآمد، کنترل ظرفیت، برنامه ریزی پویا تقریبی تقریبا تکرار سیاست،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی
In this paper, we develop a least squares approximate policy iteration (API) approach which belongs to the second class. Thereby, we suggest value function approximations that are linear in the parameters, and we estimate the parameters via linear least squares regression. Exploiting both exact and heuristic knowledge from the value function, we enforce structural constraints on the parameters to facilitate learning a good policy. We perform an extensive simulation study to investigate the performance of our approach. The results show that it is able to obtain competitive revenues compared to and often outperforms state-of-the-art capacity control methods in reasonable computational time. Depending on the scarcity of capacity and the point in time, revenue improvements of around 1% or more can be observed. Furthermore, the proposed approach contributes to simulation-based ADP, bringing forth research on numerically estimating piecewise linear value function approximations and their application in revenue management environments.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Operations Research - Volume 77, January 2017, Pages 240-253
نویسندگان
,