High altitude platform station (HAPS) and low Earth orbit (LEO) satellites that provide services to the ground in the line of sight (LoS) environment are gaining attention as next-generation communication networks. Additionally, integrating HAPS and satellite networks is being researched to provide high-quality communication services to terrestrial environments. However, a critical issue in integrated networks is the increased interference, which leads to degraded signal quality for users and a higher outage probability when the same frequency is used. In this paper, we propose a method to reduce outage probability for users being served by the HAPS by adjusting the transmission power of the satellite using deep reinforcement learning (DRL). This paper uses a multi-agent deep Q-network (MADQN) and an attention-based A-MADQN, which combines MADQN with an attention mechanism. By employing reinforcement learning, the optimal transmission power can be determined. The simulations illustrate confirmed a reduction in user outage probability compared to the conventional satellite transmission power for each cell.
This work was partly supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (No. RS-2024-00359330, Design of Low Earth Orbit Satellite Communication System) and Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No. RS-2022-II220704, Development of 3D-NET Core Technology for High-Mobility Vehicular Service).