AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
Article Link
Collect
Submit Manuscript
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Research Article

Model free optimization of building cooling water systems with refined action space

Qiaofeng Xiong1Zhengwei Li1,2( )Wenxia Cai1Zhechao Wang1
School of Mechanical Engineering Tongji University, Shanghai, China
Key Laboratory of Performance Evolution and Control for Engineering Structures of Ministry of Education, Tongji University, Shanghai, China
Show Author Information

Abstract

Deep Q Network (DQN) is an efficient model-free optimization method, and has the potential to be used in building cooling water systems. However, due to the high dimension of actions, this method requires a complex neural network. Therefore, both the required number of training samples and the length of convergence period are barriers for real application. Furthermore, penalty function based exploration may lead to unsafe actions, causing the application of this optimization method even more difficult. To solve these problems, an approach to limit the action space within a safe area is proposed in this paper. First of all, the action space for cooling towers and pumps are separated into two sub-regions. Secondly, for each type of equipment, the action space is further divided into safe and unsafe regions. As a result, the convergence speed is significantly improved. Compared with the traditional DQN method in a simulation environment validated by real data, the proposed method is able to save the convergence time by 1 episode (one cooling season). The results in this paper suggest that, the proposed DQN method can achieve a much quicker learning speed without any undesired consequences, and therefore is more suitable to be used in projects without pre-learning stage.

References

 

Ahn KU, Park CS (2020). Application of deep Q-networks for model-free optimal control balancing between different HVAC systems. Science and Technology for the Built Environment, 26: 61–74.

 

Biemann M, Scheller F, Liu X, et al. (2021). Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control. Applied Energy, 298: 117164.

 

Du Y, Li F, Munk J, et al. (2021a). Multi-task deep reinforcement learning for intelligent multi-zone residential HVAC control. Electric Power Systems Research, 192: 106959.

 

Du Y, Zandi H, Kotevska O, et al. (2021b). Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning. Applied Energy, 281: 116117.

 

Friedman JH (2001) Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29: 1189–1232.

 

Guo Q, Qi X, Wei Z, et al. (2019). Modeling and characteristic analysis of fouling in a wet cooling tower based on wavelet neural networks. Applied Thermal Engineering, 152: 907–916.

 

Kang WH, Yoon Y, Lee JH, et al. (2021). In-situ application of an ANN algorithm for optimized chilled and condenser water temperatures set-point during cooling operation. Energy and Buildings, 233: 110666.

 

Hou J, Xu P, Lu X, et al. (2018). Implementation of expansion planning in existing district energy system: A case study in China. Applied Energy, 211: 269–281.

 

Jiang Z, Risbeck MJ, Ramamurti V, et al. (2021). Building HVAC control with reinforcement learning for reduction of energy cost and demand charge. Energy and Buildings, 239: 110833.

 
Ke G, Meng Q, Finley T, et al. (2017). LightGBM: A highly efficient gradient boosting decision tree. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17).
 

Li W, Xu P, Lu X, et al. (2016). Electricity demand response in China: Status, feasible market schemes and pilots. Energy, 114: 981–994.

 

Li J, Li Z (2020). Model-based optimization of free cooling switchover temperature and cooling tower approach temperature for data center cooling system with water-side economizer. Energy and Buildings, 227: 110407.

 

Li S, Pan Y, Wang Q, et al. (2022). A non-cooperative game-based distributed optimization method for chiller plant control. Building Simulation, 15: 1015–1034.

 

Ma K, Liu M, Zhang J (2021). Online optimization method of cooling water system based on the heat transfer model for cooling tower. Energy, 231: 120896.

 

Mnih V, Kavukcuoglu K, Silver D, et al. (2015). Human-level control through deep reinforcement learning. Nature, 518: 529–533.

 

Pang Z, O'Neill Z, Li Y, et al. (2020). The role of sensitivity analysis in the building performance analysis: A critical review. Energy and Buildings, 209: 109659.

 

Pérez-Lombard L, Ortiz J, Pout C (2008). A review on buildings energy consumption information. Energy and Buildings, 40: 394–398.

 

Qiu S, Li Z, Li Z, et al. (2020a). Model-free control method based on reinforcement learning for building cooling water systems: Validation by measured data-based simulation. Energy and Buildings, 218: 110055.

 

Qiu S, Li Z, Li Z, et al. (2020b). Model-free optimal chiller loading method based on Q-learning. Science and Technology for the Built Environment, 26: 1100–1116.

 

Qiu S, Li Z, Fan D, et al. (2022). Chilled water temperature resetting using model-free reinforcement learning: Engineering application. Energy and Buildings, 255: 111694.

 

Swider DJ (2003). A comparison of empirically based steady-state models for vapor-compression liquid chillers. Applied Thermal Engineering, 23: 539–556.

 

Wang S, Ma Z (2008). Supervisory and optimal control of building HVAC systems: A review. HVAC&R Research, 14: 3–32.

 

Yuan X, Pan Y, Yang J, et al. (2021). Study on the application of reinforcement learning in the operation optimization of HVAC system. Building Simulation, 14: 75–87.

 

Zhang X, Li Z, Li Z, et al. (2022). Differential pressure reset strategy based on reinforcement learning for chilled water systems. Building Simulation, 15: 233–248.

 

Zhao T, Zhou Y, Zhang J, et al. (2021). Online differential pressure reset method with adaptive adjustment algorithm for variable chilled water flow control in central air-conditioning systems. Building Simulation, 14: 1407–1422.

Building Simulation
Pages 615-627
Cite this article:
Xiong Q, Li Z, Cai W, et al. Model free optimization of building cooling water systems with refined action space. Building Simulation, 2023, 16(4): 615-627. https://doi.org/10.1007/s12273-022-0956-2

474

Views

5

Crossref

4

Web of Science

4

Scopus

0

CSCD

Altmetrics

Received: 07 July 2022
Revised: 16 October 2022
Accepted: 26 October 2022
Published: 07 December 2022
© Tsinghua University Press 2022
Return