Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Park, Soohyun | - |
dc.contributor.author | Kim, Jae Pyoung | - |
dc.contributor.author | Park, Chanyoung | - |
dc.contributor.author | Jung, Soyi | - |
dc.contributor.author | Kim, Joongheon | - |
dc.date.issued | 2024-06-01 | - |
dc.identifier.uri | https://dspace.ajou.ac.kr/dev/handle/2018.oak/33635 | - |
dc.description.abstract | For Industry 4.0 Revolution, cooperative autonomous mobility systems are widely used based on multi-agent reinforcement learning (MARL). However, the MARL-based algorithms suffer from huge parameter utilization and convergence difficulties with many agents. To tackle these problems, a quantum MARL (QMARL) algorithm based on the concept of actor-critic network is proposed, which is beneficial in terms of scalability, to deal with the limitations in the noisy intermediate-scale quantum (NISQ) era. Additionally, our QMARL is also beneficial in terms of efficient parameter utilization and fast convergence due to quantum supremacy. Note that the reward in our QMARL is defined as task precision over computation time in multiple agents, thus, multi-agent cooperation can be realized. For further improvement, an additional technique for scalability is proposed, which is called projection value measure (PVM). Based on PVM, our proposed QMARL can achieve the highest reward by reducing the action dimension into a logarithmic-scale. Finally, we can conclude that our proposed QMARL with PVM outperforms the other algorithms in terms of efficient parameter utilization, fast convergence, and scalability. | - |
dc.description.sponsorship | This research was funded by the National Research Foundation of Korea (2022R1A2C2004869). | - |
dc.language.iso | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.subject.mesh | Autonomous mobilities | - |
dc.subject.mesh | Convergence | - |
dc.subject.mesh | Convergence difficulty | - |
dc.subject.mesh | Fast convergence | - |
dc.subject.mesh | Machine learning algorithms | - |
dc.subject.mesh | Mobility systems | - |
dc.subject.mesh | Multi-agent reinforcement learning | - |
dc.subject.mesh | Projection value | - |
dc.subject.mesh | Quantum Computing | - |
dc.subject.mesh | Reinforcement learnings | - |
dc.title | Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation | - |
dc.type | Article | - |
dc.citation.endPage | 112 | - |
dc.citation.startPage | 106 | - |
dc.citation.title | IEEE Communications Magazine | - |
dc.citation.volume | 62 | - |
dc.identifier.bibliographicCitation | IEEE Communications Magazine, Vol.62, pp.106-112 | - |
dc.identifier.doi | 10.1109/mcom.020.2300199 | - |
dc.identifier.scopusid | 2-s2.0-85169695433 | - |
dc.identifier.url | https://ieeexplore.ieee.org/servlet/opac?punumber=35 | - |
dc.description.isoa | true | - |
dc.subject.subarea | Computer Science Applications | - |
dc.subject.subarea | Computer Networks and Communications | - |
dc.subject.subarea | Electrical and Electronic Engineering | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.