Towards an Energy Efficient Computing with Coordinated Performance-Aware Scheduling in Large Scale Data Clusters

Hamandawana, Prince; Mativenga, Ronnie; Kwon, Se Jin; Chung, Tae Sun

Publication Year: 2019-01-01

Publisher: Institute of Electrical and Electronics Engineers Inc.

Citation: IEEE Access, Vol.7, pp.140261-140277

Keyword: Distributed processing energy efficiency scheduling

Mesh Keyword: Communication overheads Distributed processing Energy efficient computing Large-scale clusters Processing performance Scheduling module Slow-start phase Task distribution

All Science Classification Codes (ASJC): Computer Science (all)Materials Science (all)Engineering (all)

Abstract: Many prior works have investigated on how to increase the job processing performance and energy efficient computing in large scale clusters. However, they employ serialized scheduling approaches encompassed with task straggler 'hunting' techniques which launches speculative tasks after detecting slow tasks. These slow tasks are detected through node instrumentation which collects system level information whilst tracking the task execution progress. Such approaches are however detrimental towards achieving maximum processing performance and preserving cluster energy as they increase communication overheads. In this paper, we observe that node instrumentation and serialized scheduling in existing works does not only degrade the job processing performance, but also increase cluster energy consumption. To alleviate this, we propose EPPADS, a light-weight scheduler which eradicates the need for instrumentation modules for job scheduling purposes. EPPADS schedules tasks in two stages, the slow-start phase (SSP) and accelerate phase (AccP). The SSP schedules initial tasks in the queue using baseline FIFO scheduling and records the initial execution times of the processing nodes, whilst tagging the effective and straggling nodes. The AccP uses the initial execution times to compute the processing nodes task distribution ratio of remaining tasks and schedules them in parallel using a single scheduling I/O, boosting up the processing performance. To amortize the computing energy costs, EPPADS implements a power management module that coordinates with the scheduling module and leverage on node tagging information, to place nodes in two different power transition pools, i.e., high and low state power pools. A single power transition signal per pool is then broadcasted to lower or raise the energy state in the low-power state pool and high-power state pool. Our evaluation using a Hadoop cluster shows that EPPADS achieves 30% and 22% performance improvement and 15% to 20% energy savings as compared to the FIFO and DynMon schedulers, respectively.

ISSN: 2169-3536

Language: eng

URI: https://dspace.ajou.ac.kr/dev/handle/2018.oak/31095

DOI: https://doi.org/10.1109/access.2019.2943632

Fulltext

Type: Article

Funding: This work was supported in part by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education under Grant NRF-2019R1F1A1058548 and Grant NRF-2017R1D1A3B04031440.

Show full item record

qrcode

트윗하기

Related Researcher

HAMANDAWANA PRINCEHAMANDAWANA, PRINCE: Department of Software and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download