کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1115889 1488448 2014 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Solving Network Design Problem with Dynamic Network Loading Profiles Using Modified Reinforcement Learning Method
موضوعات مرتبط
علوم انسانی و اجتماعی علوم انسانی و هنر هنر و علوم انسانی (عمومی)
پیش نمایش صفحه اول مقاله
Solving Network Design Problem with Dynamic Network Loading Profiles Using Modified Reinforcement Learning Method
چکیده انگلیسی

This study aims to solve dynamic user Equilibrium Network Design Problem (ENDP) with dynamic network loading profiles using modified Reinforcement Learning (RL) approach. The bi-level programming technique is used to solve the problem. At the lower level of the problem, the dynamic User Equilibrium (UE) link flows are obtained by simulation based Dynamic Traffic Assignment (DTA) model with DynusT and signal timings are obtained at the upper level by modified RL method. The system Performance Index (PI) is defined as the sum of a weighted linear combination of delay and number of stops per unit time for all traffic streams, which is evaluated by the traffic model of TRANSYT-7F. Q-learning, a model-free approach, is one of the RL methods. The modified RL method is actually based on Q-learning. By integrating the modified RL method, traffic assignment and traffic control, the modified REinforcement Learning TRANSYT-7F DynusT (RELTRAD) model is proposed to solve the dynamic ENDP. The objective function of the proposed RELTRAD is total network PI. The model is tested on the medium sized Allsop and Charlesworth's network. Two scenarios, related to various dynamic network loading profiles, are proposed for numerical application. Encouraging results are obtained. Results showed that the RELTRAD model effectively optimizes the signal timings and values of the network PI. The RELTRAD model improves to the network PI from the initial value to the final value as 65% and 67% for loading profile 1 and 2, respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia - Social and Behavioral Sciences - Volume 111, 5 February 2014, Pages 38-47