AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
Article Link
Collect
Submit Manuscript
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Cover Article

Multi-objective cooling control optimization for air-liquid cooled data centers using TCN-BiGRU-Attention-based thermal prediction models

Jianpeng Lin1Wenjun Lin1Weiwei Lin1,2( )Tianyi Liu1Jiangtao Wang3Hongliang Jiang3
Department of Computer Science and Engineering, South China University of Technology, Guangzhou 510000, China
Peng Cheng Laboratory, Shenzhen 518066, China
Huawei Technologies Co., Ltd., Shenzhen 518066, China
Show Author Information

Abstract

With the breakthroughs in generative artificial intelligence (GAI) models, the vast computational demands are placing an unprecedented burden on the data center (DC) energy supply. The cooling system is the second major energy consumer in the DC, maintaining the safe and efficient operation of computing equipment. However, time-varying temperature gradients and power distribution pose a considerable challenge for efficient cooling management in DCs. For this problem, this work proposes a multi-objective cooling control optimization (MCCO) method to minimize cooling energy consumption while maximizing the rack cooling index (RCI) to ensure energy efficiency and security of hybrid-cooled DCs. The proposed method relies on high-fidelity models that characterize the dynamic thermal evolution and cooling power. Therefore, a novel network model (TCN-BiGRU-Attention) combining temporal convolutional network (TCN), bidirectional gated recurrent unit (BiGRU), and attention mechanism is designed to capture the features of multivariate time-series to predict temperature changes in thermal environments and cooling loops. Moreover, considering the complex heat transfer and operational characteristics of hybrid cooling systems, a machine learning (ML)-based power model is constructed to evaluate the holistic cooling power. Subsequently, the NSGA-Ⅱ algorithm formulates the optimal cooling decision based on the predicted thermal distribution and cooling power, realizing the trade-off between energy consumption and cooling effectiveness. The results of numerical experiments using Marconi 100 data traces suggest that the proposed MCCO significantly reduces cooling energy consumption in summer and winter while maintaining the RCI above 95%.

References

 

Asgari S, Moazamigoodarzi H, Tsai PJ, et al. (2021). Hybrid surrogate model for online temperature and pressure predictions in data centers. Future Generation Computer Systems, 114: 531–547.

 

ASHRAE (2021). Thermal Guidelines for Data Processing Environments. Atlanta, GA, USA: American Society of Heating, Refrigerating and Air-Conditioning Engineers.

 
Athavale J, Yoda M, Joshi Y (2018). Thermal modeling of data centers for control and energy usage optimization. In: Sparrow EM, Abraham JP, Gorman JM (Eds), Advances in Heat Transfer, Vol. 50. Amsterdam: Elsevier: 123–186.
 

Athavale J, Yoda M, Joshi Y (2019). Comparison of data driven modeling approaches for temperature prediction in data centers. International Journal of Heat and Mass Transfer, 135: 1039–1052.

 

Athavale J, Yoda M, Joshi Y (2021). Genetic algorithm based cooling energy optimization of data centers. International Journal of Numerical Methods for Heat & Fluid Flow, 31: 3148–3168.

 
Bai S, Kolter JZ, Koltun V (2018). An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv: 1803.01271.
 
Borghesi A, et al. (2023). M100 dataset. Available at https://zenodo.org/search?q=M100%20dataset&l=list&p=1&s=10&sort=bestmatch. Accessed 15 July 2024.
 

Cao Z, Zhou X, Hu H, et al. (2022). Toward a systematic survey for carbon neutral data centers. IEEE Communications Surveys & Tutorials, 24: 895–936.

 
Chen T, Guestrin C (2016). XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco California USA.
 

Chen S, Ding P, Zhou G, et al. (2023a). A novel machine learning-based model predictive control framework for improving the energy efficiency of air-conditioning systems. Energy and Buildings, 294: 113258.

 

Chen X, Tu R, Li M, et al. (2023b). Hot spot temperature prediction and operating parameter estimation of racks in data center using machine learning algorithms based on simulation data. Building Simulation, 16: 2159–2176.

 
Chung J, Gulcehre C, Cho K, et al. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. Paper presented in NIPS 2014 Deep Learning and Representation Learning Workshop. arXiv: 1412.3555.
 

Deb K, Pratap A, Agarwal S, et al. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-Ⅱ. IEEE Transactions on Evolutionary Computation, 6: 182–197.

 

Dinh HT, Kim D (2022). MILP-based imitation learning for HVAC control. IEEE Internet of Things Journal, 9: 6107–6120.

 
Dorogush AV, Ershov V, Gulin A (2018). CatBoost: Gradient boosting with categorical features support. Paper presented in NIPS 2018 ML Systems Workshop. arXiv: 1810.11363.
 

Du Y, Zhou Z, Yang X, et al. (2023). Dynamic thermal environment management technologies for data center: A review. Renewable and Sustainable Energy Reviews, 187: 113761.

 

Fang Q, Wang J, Gong Q, et al. (2017). Thermal-aware energy management of an HPC data center via two-time-scale control. IEEE Transactions on Industrial Informatics, 13: 2260–2269.

 

Fang Q, Zhou J, Wang S, et al. (2022). Control-oriented modeling and optimization for the temperature and airflow management in an air-cooled data-center. Neural Computing and Applications, 34: 5225–5240.

 

Habibi Khalaj A, Halgamuge SK (2017). A Review on efficient thermal management of air- and liquid-cooled data centers: From chip to the cooling system. Applied Energy, 205: 1165–1188.

 

Herrlin MK (2005). Rack cooling effectiveness in data centers and telecom central offices: The rack cooling index (RCI). ASHRAE Transactions, 111(2): 725–731.

 
IEA (2024). Electricity 2024: Analysis and Forecast to 2026. International Energy Agency. Available at https://www.iea.org/reports/electricity-2024. Accessed 15 July 2024.
 

Ilager S, Ramamohanarao K, Buyya R (2021). Thermal prediction for efficient energy management of clouds using machine learning. IEEE Transactions on Parallel and Distributed Systems, 32: 1044–1056.

 
Ke G, Meng Q, Finley T, et al. (2017). LightGBM: A highly efficient gradient boosting decision tree. In: Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.
 
Lazic N, Boutilier C, Lu T, et al. (2018). Data center cooling using model-predictive control. In: Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada.
 

Li Y, Wen Y, Tao D, et al. (2020). Transforming cooling optimization for green data center via deep reinforcement learning. IEEE Transactions on Cybernetics, 50: 2002–2013.

 

Li X, Li M, Zhang Y, et al. (2024). Rack-level cooling technologies for data centers—A comprehensive review. Journal of Building Engineering, 90: 109535.

 

Lin J, Lin W, Lin W, et al. (2022). Thermal prediction for Air-cooled data center using data Driven-based model. Applied Thermal Engineering, 217: 119207.

 

Lin J, Lin W, Huang H, et al. (2024a). Thermal modeling and thermal-aware energy saving methods for cloud data centers: a review. IEEE Transactions on Sustainable Computing, 9: 571–590.

 

Lin W, Lin J, Peng Z, et al. (2024b). A systematic review of green-aware management techniques for sustainable data center. Sustainable Computing: Informatics and Systems, 42: 100989.

 

Lionello M, Rampazzo M, Beghi A, et al. (2020). Graph-based modelling and simulation of liquid immersion cooling systems. Energy, 207: 118238.

 

Masanet E, Shehabi A, Lei N, et al. (2020). Recalibrating global data center energy-use estimates. Science, 367: 984–986.

 

MirhoseiniNejad S, Badawy G, Down DG (2021a). A data-driven, multi-setpoint model predictive thermal control system for data centers. Journal of Network and Systems Management, 29: 7.

 

MirhoseiniNejad S, Badawy G, Down DG (2021b). Holistic thermal-aware workload management and infrastructure control for heterogeneous data centers using machine learning. Future Generation Computer Systems, 118: 208–218.

 

Nogrady B (2021). China launches world’s largest carbon market: But is it ambitious enough? Nature, 595: 637.

 

Tang Q, Gupta SKS, Varsamopoulos G (2008). Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: a cyber-physical approach. IEEE Transactions on Parallel and Distributed Systems, 19: 1458–1472.

 
Tian M, Vishwanath A, Venkataramani G, et al. (2020). SpinSmart: Exploring optimal server fan speeds to improve overall system energy consumption. In: Proceedings of the 11th ACM International Conference on Future Energy Systems.
 

Van Le D, Wang R, Liu Y, et al. (2021). Deep reinforcement learning for tropical air free-cooled data center control. ACM Transactions on Sensor Networks, 17: 1–28.

 
Vaswani A, Shazeer N, Parmar N, et al. (2017). Attention is all you need. In: Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.
 

Wang H, Chen X, Vital N, et al. (2024a). Energy optimization for HVAC systems in multi-VAV open offices: A deep reinforcement learning approach. Applied Energy, 356: 122354.

 

Wang R, Cao Z, Zhou X, et al. (2024b). Green data center cooling control via physics-guided safe reinforcement learning. ACM Transactions on Cyber-Physical Systems, 8: 1–26.

 

Xiong Q, Li Z, Cai W, et al. (2023). Model free optimization of building cooling water systems with refined action space. Building Simulation, 16: 615–627.

 

Zhang Q, Meng Z, Hong X, et al. (2021). A survey on data center cooling systems: Technology, power consumption modeling and control strategy optimization. Journal of Systems Architecture, 119: 102253.

Building Simulation
Pages 2145-2161
Cite this article:
Lin J, Lin W, Lin W, et al. Multi-objective cooling control optimization for air-liquid cooled data centers using TCN-BiGRU-Attention-based thermal prediction models. Building Simulation, 2024, 17(12): 2145-2161. https://doi.org/10.1007/s12273-024-1185-7

19

Views

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Altmetrics

Received: 19 June 2024
Revised: 17 August 2024
Accepted: 30 August 2024
Published: 21 October 2024
© Tsinghua University Press 2024
Return