TY - GEN
T1 - Empirical Evaluation of ML Models for Per-Job Power Prediction
AU - Halder, Debajyoti
AU - Acharya, Manas
AU - Malsane, Aniket
AU - Gandhi, Anshul
AU - Zadok, Erez
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2024/5/7
Y1 - 2024/5/7
N2 - Sustainability has become a critical focus area across the technology industry, most notably in cloud data centers. In such shared-use computing environments, there is a need to account for the power consumption of individual users. Prior work on power prediction of individual user jobs in shared environments has often focused on workloads that stress a single resource, such as CPU or DRAM. These works typically employ a specific machine learning (ML) model to train and test on the target workload for high accuracy. However, modern workloads in data centers can stress multiple resources simultaneously, and cannot be assumed to always be available for training. This paper empirically evaluates the performance of various ML models under different model settings and training data assumptions for the per-job power prediction problem using a range of workloads. Our evaluation results provide key insights into the efficacy of different ML models. For example, we find that linear ML models suffer from poor prediction accuracy (as much as 25% prediction error), especially for unseen workloads. Conversely, non-linear models, specifically XGBoost and Random Forest, provide reasonable accuracy (7 - 9% error). We also find that data-normalization and the power-prediction model formulation affect the accuracy of individual ML models in different ways.
AB - Sustainability has become a critical focus area across the technology industry, most notably in cloud data centers. In such shared-use computing environments, there is a need to account for the power consumption of individual users. Prior work on power prediction of individual user jobs in shared environments has often focused on workloads that stress a single resource, such as CPU or DRAM. These works typically employ a specific machine learning (ML) model to train and test on the target workload for high accuracy. However, modern workloads in data centers can stress multiple resources simultaneously, and cannot be assumed to always be available for training. This paper empirically evaluates the performance of various ML models under different model settings and training data assumptions for the per-job power prediction problem using a range of workloads. Our evaluation results provide key insights into the efficacy of different ML models. For example, we find that linear ML models suffer from poor prediction accuracy (as much as 25% prediction error), especially for unseen workloads. Conversely, non-linear models, specifically XGBoost and Random Forest, provide reasonable accuracy (7 - 9% error). We also find that data-normalization and the power-prediction model formulation affect the accuracy of individual ML models in different ways.
KW - co-executed workloads.
KW - ml models
KW - per-job power prediction
KW - sustainability
UR - https://www.scopus.com/pages/publications/85193976350
U2 - 10.1145/3629527.3651418
DO - 10.1145/3629527.3651418
M3 - Conference contribution
AN - SCOPUS:85193976350
T3 - ICPE 2024 - Companion of the 15th ACM/SPEC International Conference on Performance Engineering
SP - 181
EP - 188
BT - ICPE 2024 - Companion of the 15th ACM/SPEC International Conference on Performance Engineering
PB - Association for Computing Machinery, Inc
T2 - 15th ACM/SPEC International Conference on Performance Engineering, ICPE 2024
Y2 - 7 May 2024 through 11 May 2024
ER -