Navigating the non-compliance effects on system optimal route guidance using reinforcement learning

Yun, Hyunsoo; Kim, Eui jin; Ham, Seung Woo; Kim, Dong Kyu

Publication Year: 2024-08-01

Publisher: Elsevier Ltd

Citation: Transportation Research Part C: Emerging Technologies, Vol.165

Keyword: Autonomous vehicles Dynamic traffic assignment Multi-agent reinforcement learning Reinforcement learning System optimal assignment

Mesh Keyword: Autonomous Vehicles Dynamic traffic Dynamic traffic assignment Multi-agent reinforcement learning Optimal assignment Optimal routes Reinforcement learnings System optimal assignment System-optimal Traffic assignment

All Science Classification Codes (ASJC): Civil and Structural Engineering Automotive Engineering Transportation Management Science and Operations Research

Abstract: We consider a scenario where the transportation management center (TMC) guides future autonomous vehicles (AVs) toward optimal routes, aiming to bring the network in line with the system optimal (SO) principle. However, achieving this requires a joint decision-making process, while users may be non-compliant with the TMC's route guidance for personal gain. This paper models a future transportation network with a microscopic simulation, to introduce a novel concept of mixed equilibrium. In this framework, AVs follow the TMC's SO route guidance, while users can dynamically choose to either comply or manually override this autonomy based on their own judgment. We initially model a fully compliant scenario, where the centralized Q-network, analogous to a TMC, is trained using reinforcement learning (RL) to minimize total system travel time (TSTT), providing optimal routes to users. Subsequently, we extend the problem setting to a multi-agent reinforcement learning (MARL) scenario, where users can comply or deviate from the TMC's guidance based on their own decision-making. Through neural fictitious self-play (NFSP), we employ a modulating hyperparameter to investigate the impact of varying degrees of non-compliance on the overall system. Results indicate that our RL approach holds significant potential for addressing the dynamic system optimal assignment problem. Remarkably, the TMC's route guidance retains the essence of SO while integrating some level of non-compliance. However, we also demonstrate that dominant user-centric decision-making may lead to system inefficiencies while creating disparities among users. Our framework serves as an innovative tool in an AV-dominant future, offering a realistic perspective on network performance that aids in formulating effective traffic management strategies.

ISSN: 0968-090X

Language: eng

URI: https://dspace.ajou.ac.kr/dev/handle/2018.oak/34303

DOI: https://doi.org/10.1016/j.trc.2024.104721

Fulltext

Type: Article

Funding: This research was supported by Korea Ministry of Land, Infrastructure and Transport (MOLIT) as [Innovative Talent Education Program for Smart City], and in part by the National Research Foundation of Korea (NRF) grant funded by the Korea Government ( MSIT ) (No. 2022R1A2C2012835 ). This work was also supported by Korea Institute of Police Technology (KIPoT) grant funded by the Korea Government ( KNPA ) (No. 092021C28S02000 ).

Show full item record

qrcode

트윗하기

Related Researcher

Kim, Eui-Jin김의진: Department of Transportation System Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download