کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6935745 1449655 2018 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A reinforcement learning framework for the adaptive routing problem in stochastic time-dependent network
ترجمه فارسی عنوان
یک چارچوب یادگیری تقویت کننده برای مشکل مسیریابی سازگار در شبکه وابسته به زمان اتفاقی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی
Most previous work in addressing the adaptive routing problem in stochastic and time-dependent (STD) network has been focusing on developing parametric models to reflect the network dynamics and designing efficient algorithms to solve these models. However, strong assumptions need to be made in the models and some algorithms also suffer from the curse of dimensionality. In this paper, we examine the application of Reinforcement Learning as a non-parametric model-free method to solve the problem. Both the online Q learning method for discrete state space and the offline fitted Q iteration algorithm for continuous state space are discussed. With a small case study on a mid-sized network, we demonstrate the significant advantages of using Reinforcement Learning to solve for the optimal routing policy over traditional stochastic dynamic programming method. And the fitted Q iteration algorithm combined with tree-based function approximation is shown to outperform other methods especially during peak demand periods.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Transportation Research Part C: Emerging Technologies - Volume 93, August 2018, Pages 179-197
نویسندگان
, ,