This article proposes a novel algorithm, named quantum multiagent actor-critic networks (QMACN) for autonomously constructing a robust mobile access system employing multiple unmanned aerial vehicles (UAVs). In the context of facilitating collaboration among multiple UAVs, the application of multiagent reinforcement learning (MARL) techniques is regarded as a promising approach. These methods enable UAVs to learn collectively, optimizing their actions within a shared environment, ultimately leading to more efficient cooperative behavior. Furthermore, the principles of quantum computing (QC) are employed in our study to enhance the training process and inference capabilities of the UAVs involved. By leveraging the unique computational advantages of QC, our approach aims to boost the overall effectiveness of the UAV system. However, employing a QC introduces scalability challenges due to the near intermediate-scale quantum (NISQ) limitation associated with qubit usage. The proposed algorithm addresses this issue by implementing a quantum centralized critic, effectively mitigating the constraints imposed by NISQ limitations. Additionally, the advantages of the QMACN with performance improvements in terms of training speed and wireless service quality are verified via various data-intensive evaluations. Furthermore, this article validates that a noise injection scheme can be used for handling environmental uncertainties in order to realize robust mobile access.
This work was supported by the JSPS/NRF/NSFC A3 Foresight Program. This article was presented in part at IEEE International Conference on Distributed Computing Systems (ICDCS), Bologna, Italy, July 2022 [DOI: 10.1109/ICDCS54860.2022.00151].