Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation

Park, Soohyun; Kim, Jae Pyoung; Park, Chanyoung; Jung, Soyi; Kim, Joongheon

DC Field	Value	Language
dc.contributor.author	Park, Soohyun	-
dc.contributor.author	Kim, Jae Pyoung	-
dc.contributor.author	Park, Chanyoung	-
dc.contributor.author	Jung, Soyi	-
dc.contributor.author	Kim, Joongheon	-
dc.date.issued	2024-06-01	-
dc.identifier.uri	https://dspace.ajou.ac.kr/dev/handle/2018.oak/33635	-
dc.description.abstract	For Industry 4.0 Revolution, cooperative autonomous mobility systems are widely used based on multi-agent reinforcement learning (MARL). However, the MARL-based algorithms suffer from huge parameter utilization and convergence difficulties with many agents. To tackle these problems, a quantum MARL (QMARL) algorithm based on the concept of actor-critic network is proposed, which is beneficial in terms of scalability, to deal with the limitations in the noisy intermediate-scale quantum (NISQ) era. Additionally, our QMARL is also beneficial in terms of efficient parameter utilization and fast convergence due to quantum supremacy. Note that the reward in our QMARL is defined as task precision over computation time in multiple agents, thus, multi-agent cooperation can be realized. For further improvement, an additional technique for scalability is proposed, which is called projection value measure (PVM). Based on PVM, our proposed QMARL can achieve the highest reward by reducing the action dimension into a logarithmic-scale. Finally, we can conclude that our proposed QMARL with PVM outperforms the other algorithms in terms of efficient parameter utilization, fast convergence, and scalability.	-
dc.description.sponsorship	This research was funded by the National Research Foundation of Korea (2022R1A2C2004869).	-
dc.language.iso	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.subject.mesh	Autonomous mobilities	-
dc.subject.mesh	Convergence	-
dc.subject.mesh	Convergence difficulty	-
dc.subject.mesh	Fast convergence	-
dc.subject.mesh	Machine learning algorithms	-
dc.subject.mesh	Mobility systems	-
dc.subject.mesh	Multi-agent reinforcement learning	-
dc.subject.mesh	Projection value	-
dc.subject.mesh	Quantum Computing	-
dc.subject.mesh	Reinforcement learnings	-
dc.title	Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation	-
dc.type	Article	-
dc.citation.endPage	112	-
dc.citation.startPage	106	-
dc.citation.title	IEEE Communications Magazine	-
dc.citation.volume	62	-
dc.identifier.bibliographicCitation	IEEE Communications Magazine, Vol.62, pp.106-112	-
dc.identifier.doi	10.1109/mcom.020.2300199	-
dc.identifier.scopusid	2-s2.0-85169695433	-
dc.identifier.url	https://ieeexplore.ieee.org/servlet/opac?punumber=35	-
dc.description.isoa	true	-
dc.subject.subarea	Computer Science Applications	-
dc.subject.subarea	Computer Networks and Communications	-
dc.subject.subarea	Electrical and Electronic Engineering	-

Show simple item record

qrcode

트윗하기

Related Researcher

Jung, Soyi정소이: Department of Electrical and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download