Ajou University repository

Dynamic Reinforcement Learning for Optimal Go AI Training: Adaptive Adjustment and Optimization
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorZhang, Chunjiong-
dc.contributor.authorShan, Gaoyang-
dc.contributor.authorLim, Junghyun-
dc.contributor.authorRoh, Byeong Hee-
dc.date.issued2024-01-01-
dc.identifier.urihttps://dspace.ajou.ac.kr/dev/handle/2018.oak/34577-
dc.description.abstractGo is a popular strategy game today, but due to its large search space and task complexity, ensuring stable AI implementation is challenging. Specifically, Go AI training requires setting a fixed optimal learning rate and schedule, which demands significant TPU and GPU resources. To facilitate Go-AI learning, this research explores adaptive adjustment and optimization techniques for dynamic reinforcement learning neural networks. First, we introduce a dynamic batch size technique that adjusts data volume at each training phase and incorporates dynamic network structure search, considering the number of network layers and residual blocks. Second, we propose a dynamic network topology that automatically modifies the learning rate based on the training batch size at various training phases. Our approach outperforms the baseline in terms of stability and model convergence speed. In 100 games, the Go-AI model achieved a 100% victory rate below the 7th rank and a 98% win rate at the 9th rank and higher.-
dc.description.sponsorshipThis work was supported partially by the Brain Korea 21 (BK21) FOUR program of the National Research Foundation of Korea funded by the Ministry of Education (NRF5199991514504). (Corresponding author: Byeong-hee Roh, E-mail: bhroh@ajou.ac.kr) C. Zhang, J. Lim and B. Roh are with the Department of AI Convergence Network, Ajou University, Suwon, 16499, Korea.(E-mail: {cjz, wjdguszoqt, bhroh}@ajou.ac.kr) G. Shan is with the Department of Software and Computer Engineering, Ajou University, Suwon, 16499, Korea.(E-mail: shanyang166@ajou.ac.kr) Manuscript received xxx; revised xxx.-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshAdaptive-
dc.subject.meshAdaptive adjustment-
dc.subject.meshAdaptive optimization-
dc.subject.meshBatch sizes-
dc.subject.meshDynamic reinforcements-
dc.subject.meshGo AI-
dc.subject.meshLearning rates-
dc.subject.meshNeural-networks-
dc.subject.meshTraining phasis-
dc.subject.meshWeight-
dc.titleDynamic Reinforcement Learning for Optimal Go AI Training: Adaptive Adjustment and Optimization-
dc.typeArticle-
dc.citation.titleIEEE Transactions on Consumer Electronics-
dc.identifier.bibliographicCitationIEEE Transactions on Consumer Electronics-
dc.identifier.doi10.1109/tce.2024.3487141-
dc.identifier.scopusid2-s2.0-85208278249-
dc.identifier.urlhttps://ieeexplore.ieee.org/servlet/opac?punumber=30-
dc.subject.keywordadaptive-
dc.subject.keywordGo AI-
dc.subject.keywordlearning rate-
dc.subject.keywordneural networks-
dc.subject.keywordweights-
dc.description.isoafalse-
dc.subject.subareaMedia Technology-
dc.subject.subareaElectrical and Electronic Engineering-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

SHAN GAOYANG Image
SHAN GAOYANGSHAN, GAOYANG
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.