Ajou University repository

하모니 서치 알고리즘을 이용한 심층 강화학습 하이퍼파라미터 최적화
  • 안상현 ;
  • 김동욱 ;
  • 이관우 ;
  • 이기한 ;
  • 박상철
Citations

SCOPUS

0

Citation Export

Publication Year
2023-06
Journal
한국CDE학회 논문집
Publisher
한국CDE학회
Citation
한국CDE학회 논문집, Vol.28 No.2, pp.97-106
Keyword
Hyperparameter tuningOptimizationReinforcement learning
Abstract
This study demonstrates that using the Harmony Search Algorithm (HSA) for hyperparameter optimization in Deep Reinforcement Learning (DeepRL) is effective in environments with well designed reward functions. To address the reproducibility issue in DeepRL, the algorithm was modified to adopt the best parameters in each generation independent of the harmony memory consideration rate (HMCR) and to prevent the best parameters from being influenced by the pitch adjustment rate (PAR). The objective function was set as cumulative reward or terminal reward depending on the environment. The PPO algorithm parameters and actor-critic network parameters were optimized in five different environments. The results show that the harmony search algorithm can optimize hyperparameters even in large and complex environments with substantial interactions if the reward function is well-designed.
ISSN
2508-4003
Language
Kor
URI
https://aurora.ajou.ac.kr/handle/2018.oak/37833
https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002963515
DOI
https://doi.org/10.7315/CDE.2023.097
Type
Article
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Park, SangChul Image
Park, SangChul박상철
Department of Industrial Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.