To extend the network's life cycle in wireless sensor networks, clustering plays an important role in balancing energy consumption. In this paper, we propose a novel clustering method based on reinforcement learning that integrates cluster head selection and cluster formation as one step. It considers both energy efficiency and inter cluster interference in the model-free design, thus achieving longer network lifetime and higher quality of packet transmission. To the best of our knowledge, our work is the first paper that integrates cluster head selection and cluster formation using reinforcement learning. Our extensive simulation results show that the proposed method improves the network lifetime by 65% and 29% compared with Low Energy Adaptive Clustering Hierarchy (LEACH) and Greedy Energy Efficient Clustering Scheme (GEECS), respectively, while the data transmission success rate is also increased by 42% and 31%, respectively.
VII. ACKNOWLEDGEMENT This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2023R1A2C1003783) and the BK21 FOUR program of the National Research Foundation of Korea funded by the Ministry of Education (NRF5199991014091).