Ajou University repository

Navigating the non-compliance effects on system optimal route guidance using reinforcement learningoa mark
  • Yun, Hyunsoo ;
  • Kim, Eui jin ;
  • Ham, Seung Woo ;
  • Kim, Dong Kyu
Citations

SCOPUS

1

Citation Export

Publication Year
2024-08-01
Publisher
Elsevier Ltd
Citation
Transportation Research Part C: Emerging Technologies, Vol.165
Keyword
Autonomous vehiclesDynamic traffic assignmentMulti-agent reinforcement learningReinforcement learningSystem optimal assignment
Mesh Keyword
Autonomous VehiclesDynamic trafficDynamic traffic assignmentMulti-agent reinforcement learningOptimal assignmentOptimal routesReinforcement learningsSystem optimal assignmentSystem-optimalTraffic assignment
All Science Classification Codes (ASJC)
Civil and Structural EngineeringAutomotive EngineeringTransportationManagement Science and Operations Research
Abstract
We consider a scenario where the transportation management center (TMC) guides future autonomous vehicles (AVs) toward optimal routes, aiming to bring the network in line with the system optimal (SO) principle. However, achieving this requires a joint decision-making process, while users may be non-compliant with the TMC's route guidance for personal gain. This paper models a future transportation network with a microscopic simulation, to introduce a novel concept of mixed equilibrium. In this framework, AVs follow the TMC's SO route guidance, while users can dynamically choose to either comply or manually override this autonomy based on their own judgment. We initially model a fully compliant scenario, where the centralized Q-network, analogous to a TMC, is trained using reinforcement learning (RL) to minimize total system travel time (TSTT), providing optimal routes to users. Subsequently, we extend the problem setting to a multi-agent reinforcement learning (MARL) scenario, where users can comply or deviate from the TMC's guidance based on their own decision-making. Through neural fictitious self-play (NFSP), we employ a modulating hyperparameter to investigate the impact of varying degrees of non-compliance on the overall system. Results indicate that our RL approach holds significant potential for addressing the dynamic system optimal assignment problem. Remarkably, the TMC's route guidance retains the essence of SO while integrating some level of non-compliance. However, we also demonstrate that dominant user-centric decision-making may lead to system inefficiencies while creating disparities among users. Our framework serves as an innovative tool in an AV-dominant future, offering a realistic perspective on network performance that aids in formulating effective traffic management strategies.
ISSN
0968-090X
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/34303
DOI
https://doi.org/10.1016/j.trc.2024.104721
Fulltext

Type
Article
Funding
This research was supported by Korea Ministry of Land, Infrastructure and Transport (MOLIT) as [Innovative Talent Education Program for Smart City], and in part by the National Research Foundation of Korea (NRF) grant funded by the Korea Government ( MSIT ) (No. 2022R1A2C2012835 ). This work was also supported by Korea Institute of Police Technology (KIPoT) grant funded by the Korea Government ( KNPA ) (No. 092021C28S02000 ).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Kim, Eui-Jin Image
Kim, Eui-Jin김의진
Department of Transportation System Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.