In this study, we consider wireless covert communication within unmanned aerial vehicle (UAV) environments. Here, the UAV functions as a covert transmitter, sending data to predetermined ground receivers while avoiding detection by ground-based detectors. We aim to maximize the UAVs' through-put and the detector's minimum detection error probability by optimizing the UAV's transmission power and positioning through Q-learning. We utilize reinforcement learning to de-termine UAVs' optimal transmission power and location in complex environments, ensuring effective problem-solving even in challenging scenarios.
This research was supported in part by Korea Institute of Science and Technology Information (No. (KISTI)K25L4M1C3, Construction of Information security scheme for supercomputing environment based on AI, '25) and in part by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.RS-2024-00396992 Development of Cube Satellites Based on Core Technologies in Low Earth Orbit Satellite Communications)