TY - GEN
T1 - AI-assisted Stochastic Optimization for GPU Data Centers Lifecycle Planning
AU - Nie, Chengyi
AU - Xing, Anna
AU - Latif, Imran
AU - Liu, Zhenhua
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2025/6/16
Y1 - 2025/6/16
N2 - The recent advancement in artificial intelligence (AI) boosts the demand for AI-related workloads, significantly increasing GPU deployment and associated power consumption in data centers. This trend accelerates hardware upgrades, introducing the need for GPU upgrades planning, reallocation, and retirement. Existing lifecycle planning methods typically rely on static provisioning or heuristic-based strategies, which make it challenging to address uncertainties in workload demand, hardware performance degradation, power limitations, etc. As a result, poor resource utilization and increased operating costs are common in data centers. We propose an AI-assisted stochastic optimization framework for GPU lifecycle planning in data centers. This framework utilizes large language models to generate predictive scenarios for future workload demands, hardware performance decay, etc., in different cases. These AI-generated insights are integrated into a stochastic optimization model, which helps stakeholders make decisions about the timing of upgrades and retirements of GPU and cooling systems in data centers.
AB - The recent advancement in artificial intelligence (AI) boosts the demand for AI-related workloads, significantly increasing GPU deployment and associated power consumption in data centers. This trend accelerates hardware upgrades, introducing the need for GPU upgrades planning, reallocation, and retirement. Existing lifecycle planning methods typically rely on static provisioning or heuristic-based strategies, which make it challenging to address uncertainties in workload demand, hardware performance degradation, power limitations, etc. As a result, poor resource utilization and increased operating costs are common in data centers. We propose an AI-assisted stochastic optimization framework for GPU lifecycle planning in data centers. This framework utilizes large language models to generate predictive scenarios for future workload demands, hardware performance decay, etc., in different cases. These AI-generated insights are integrated into a stochastic optimization model, which helps stakeholders make decisions about the timing of upgrades and retirements of GPU and cooling systems in data centers.
UR - https://www.scopus.com/pages/publications/105016353557
U2 - 10.1145/3679240.3735099
DO - 10.1145/3679240.3735099
M3 - Conference contribution
AN - SCOPUS:105016353557
T3 - E-ENERGY 2025 - Proceedings of the 2025 16th ACM International Conference on Future and Sustainable Energy Systems
SP - 870
EP - 873
BT - E-ENERGY 2025 - Proceedings of the 2025 16th ACM International Conference on Future and Sustainable Energy Systems
PB - Association for Computing Machinery, Inc
T2 - 16th ACM International Conference on Future and Sustainable Energy Systems, E-ENERGY 2025
Y2 - 17 June 2025 through 20 June 2025
ER -