In this work, we present a deep reinforcement learning-based approach as a baseline system for autonomous propofol infusion control. Specifically, design an environment for simulating the possible conditions of a target patient based on input demographic data and design our reinforcement learning model-based system so that it effectively makes predictions on the proper level of propofol infusion to maintain stable anesthesia even under dynamic conditions that can affect the decision-making process, such as the manual control of remifentanil by anesthesiologists and the varying patient conditions under anesthesia. Through an extensive set of evaluations using patient data from 3000 subjects, we show that the proposed method results in stabilization in the anesthesia state, by managing the bispectral index (BIS) and effect-site concentration for a patient showing varying conditions.
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2020R1C1C1014905 ), the Ministry of Science and ICT\u2019s ITRC Program supervised by IITP ( IITP-2021-2020-0-01461 ), and the National Research Foundation of Korea ( 2022R1A2C2004869 ).