Citation Export
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Oh, Sangyoon | - |
| dc.contributor.author | Choi, Jiheon | - |
| dc.contributor.author | Lee, Jaehyun | - |
| dc.contributor.author | Choo, Minsol | - |
| dc.contributor.author | Yoon, Taeyoung | - |
| dc.contributor.author | Kwon, Oh Kyoung | - |
| dc.date.issued | 2025-03-27 | - |
| dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/38559 | - |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105002271701&origin=inward | - |
| dc.description.abstract | High-performance computing (HPC) systems face complex resource management challenges as they accommodate diverse workloads, from traditional scientific simulations to emerging AI workflows. Accurate workload characterization is essential for optimal resource allocation, yet existing approaches often require extensive labeled data or rely on rigid heuristics that struggle to adapt to new workload patterns. This paper presents RF-AUS (Random Forest Adaptive Uncertainty Sampling), a novel active learning framework designed for HPC workload classification. Our proposed RF-AUS leverages ensemble-based uncertainty measurement, out-of-bag estimates, and feature importance weighting to achieve high classification performance with minimal labeled data. In addition, it employs a diversity-aware sampling strategy to balance exploration and exploitation, ensuring comprehensive coverage of the feature space while prioritizing the most informative instances. Using real-world workload data from NURION, a leading HPC system in South Korea, we demonstrate that RF-AUS achieves a comparable classification accuracy with 47.53% fewer labeled samples than conventional methods. RF-AUS shows robust performance across diverse applications, making it particularly well suited for production HPC environments where efficient resource utilization is crucial. | - |
| dc.description.sponsorship | The authors wish to acknowledge the use of Google Translate, Grammarly, Writefull, and ChatGPT in the writing of this paper. Google Translate was used to assist with translation of certain sentences from its original language, and Grammarly, Writefull, and ChatGPT were used to enhance the grammar, clarity, and style of the manuscript. The paper remains an accurate representation of the authors' underlying work and novel intellectual contributions. This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (RS- 2023-00283799). The corresponding author is Sangyoon Oh. | - |
| dc.language.iso | eng | - |
| dc.publisher | Association for Computing Machinery, Inc | - |
| dc.subject.mesh | Active Learning | - |
| dc.subject.mesh | High performance computing systems | - |
| dc.subject.mesh | High-performance computing application characteristic prediction | - |
| dc.subject.mesh | High-performance computing applications | - |
| dc.subject.mesh | Labeled data | - |
| dc.subject.mesh | Performance | - |
| dc.subject.mesh | Performance computing | - |
| dc.subject.mesh | Random forests | - |
| dc.subject.mesh | Resource management | - |
| dc.subject.mesh | Uncertainty samplings | - |
| dc.title | When HPC Scheduling Meets Active Learning: Maximizing The Performance with Minimal Data | - |
| dc.type | Conference | - |
| dc.citation.conferenceDate | 2025.02.19.~2025.02.21. | - |
| dc.citation.conferenceName | 8th International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2025 | - |
| dc.citation.edition | Proceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2025 | - |
| dc.citation.endPage | 109 | - |
| dc.citation.startPage | 99 | - |
| dc.citation.title | Proceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2025 | - |
| dc.identifier.bibliographicCitation | Proceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2025, pp.99-109 | - |
| dc.identifier.doi | 10.1145/3712031.3712334 | - |
| dc.identifier.scopusid | 2-s2.0-105002271701 | - |
| dc.identifier.url | http://dl.acm.org/citation.cfm?id=3712031 | - |
| dc.subject.keyword | Active Learning | - |
| dc.subject.keyword | HPC Application Characteristic Prediction | - |
| dc.subject.keyword | Uncertainty Sampling | - |
| dc.type.other | Conference Paper | - |
| dc.description.isoa | false | - |
| dc.subject.subarea | Hardware and Architecture | - |
| dc.subject.subarea | Software | - |
| dc.subject.subarea | Theoretical Computer Science | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.