Model free optimization of building cooling water systems with refined action space

Qiaofeng Xiong; Zhengwei Li; Wenxia Cai; Zhechao Wang

doi:10.1007/s12273-022-0956-2

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Journals A - Z

About Us

Publish with Us

Support

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Research Article

Model free optimization of building cooling water systems with refined action space

Qiaofeng Xiong^¹, Zhengwei Li^{¹^,²}(

), Wenxia Cai^¹, Zhechao Wang^¹

School of Mechanical Engineering Tongji University, Shanghai, China

Key Laboratory of Performance Evolution and Control for Engineering Structures of Ministry of Education, Tongji University, Shanghai, China

Show Author Information

Abstract

Deep Q Network (DQN) is an efficient model-free optimization method, and has the potential to be used in building cooling water systems. However, due to the high dimension of actions, this method requires a complex neural network. Therefore, both the required number of training samples and the length of convergence period are barriers for real application. Furthermore, penalty function based exploration may lead to unsafe actions, causing the application of this optimization method even more difficult. To solve these problems, an approach to limit the action space within a safe area is proposed in this paper. First of all, the action space for cooling towers and pumps are separated into two sub-regions. Secondly, for each type of equipment, the action space is further divided into safe and unsafe regions. As a result, the convergence speed is significantly improved. Compared with the traditional DQN method in a simulation environment validated by real data, the proposed method is able to save the convergence time by 1 episode (one cooling season). The results in this paper suggest that, the proposed DQN method can achieve a much quicker learning speed without any undesired consequences, and therefore is more suitable to be used in projects without pre-learning stage.

Keywords

cooling tower building cooling water system cooling water pump DQN controller convergence speed

References

Ahn KU, Park CS (2020). Application of deep Q-networks for model-free optimal control balancing between different HVAC systems. Science and Technology for the Built Environment, 26: 61–74.

Crossref Google Scholar

Biemann M, Scheller F, Liu X, et al. (2021). Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control. Applied Energy, 298: 117164.

Crossref Google Scholar

Du Y, Li F, Munk J, et al. (2021a). Multi-task deep reinforcement learning for intelligent multi-zone residential HVAC control. Electric Power Systems Research, 192: 106959.

Crossref Google Scholar

Du Y, Zandi H, Kotevska O, et al. (2021b). Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning. Applied Energy, 281: 116117.

Crossref Google Scholar

Friedman JH (2001) Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29: 1189–1232.

Crossref Google Scholar

Guo Q, Qi X, Wei Z, et al. (2019). Modeling and characteristic analysis of fouling in a wet cooling tower based on wavelet neural networks. Applied Thermal Engineering, 152: 907–916.

Crossref Google Scholar

Kang WH, Yoon Y, Lee JH, et al. (2021). In-situ application of an ANN algorithm for optimized chilled and condenser water temperatures set-point during cooling operation. Energy and Buildings, 233: 110666.

Crossref Google Scholar

Hou J, Xu P, Lu X, et al. (2018). Implementation of expansion planning in existing district energy system: A case study in China. Applied Energy, 211: 269–281.

Crossref Google Scholar

Jiang Z, Risbeck MJ, Ramamurti V, et al. (2021). Building HVAC control with reinforcement learning for reduction of energy cost and demand charge. Energy and Buildings, 239: 110833.

Crossref Google Scholar

Ke G, Meng Q, Finley T, et al. (2017). LightGBM: A highly efficient gradient boosting decision tree. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17).

Li W, Xu P, Lu X, et al. (2016). Electricity demand response in China: Status, feasible market schemes and pilots. Energy, 114: 981–994.

Crossref Google Scholar

Li J, Li Z (2020). Model-based optimization of free cooling switchover temperature and cooling tower approach temperature for data center cooling system with water-side economizer. Energy and Buildings, 227: 110407.

Crossref Google Scholar

Li S, Pan Y, Wang Q, et al. (2022). A non-cooperative game-based distributed optimization method for chiller plant control. Building Simulation, 15: 1015–1034.

Crossref Google Scholar

Ma K, Liu M, Zhang J (2021). Online optimization method of cooling water system based on the heat transfer model for cooling tower. Energy, 231: 120896.

Crossref Google Scholar

Mnih V, Kavukcuoglu K, Silver D, et al. (2015). Human-level control through deep reinforcement learning. Nature, 518: 529–533.

Crossref Google Scholar

Pang Z, O'Neill Z, Li Y, et al. (2020). The role of sensitivity analysis in the building performance analysis: A critical review. Energy and Buildings, 209: 109659.

Crossref Google Scholar

Pérez-Lombard L, Ortiz J, Pout C (2008). A review on buildings energy consumption information. Energy and Buildings, 40: 394–398.

Crossref Google Scholar

Qiu S, Li Z, Li Z, et al. (2020a). Model-free control method based on reinforcement learning for building cooling water systems: Validation by measured data-based simulation. Energy and Buildings, 218: 110055.

Crossref Google Scholar

Qiu S, Li Z, Li Z, et al. (2020b). Model-free optimal chiller loading method based on Q-learning. Science and Technology for the Built Environment, 26: 1100–1116.

Crossref Google Scholar

Qiu S, Li Z, Fan D, et al. (2022). Chilled water temperature resetting using model-free reinforcement learning: Engineering application. Energy and Buildings, 255: 111694.

Crossref Google Scholar

Swider DJ (2003). A comparison of empirically based steady-state models for vapor-compression liquid chillers. Applied Thermal Engineering, 23: 539–556.

Crossref Google Scholar

Wang S, Ma Z (2008). Supervisory and optimal control of building HVAC systems: A review. HVAC&R Research, 14: 3–32.

Crossref Google Scholar

Yuan X, Pan Y, Yang J, et al. (2021). Study on the application of reinforcement learning in the operation optimization of HVAC system. Building Simulation, 14: 75–87.

Crossref Google Scholar

Zhang X, Li Z, Li Z, et al. (2022). Differential pressure reset strategy based on reinforcement learning for chilled water systems. Building Simulation, 15: 233–248.

Crossref Google Scholar

Zhao T, Zhou Y, Zhang J, et al. (2021). Online differential pressure reset method with adaptive adjustment algorithm for variable chilled water flow control in central air-conditioning systems. Building Simulation, 14: 1407–1422.

Crossref Google Scholar

Building Simulation

Volume 16 Issue 4,
April 2023

Pages 615-627

DOI: 10.1007/s12273-022-0956-2

Cite this article:

Xiong Q, Li Z, Cai W, et al. Model free optimization of building cooling water systems with refined action space. Building Simulation, 2023, 16(4): 615-627. https://doi.org/10.1007/s12273-022-0956-2

474

Views

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Altmetrics

Received: 07 July 2022

Revised: 16 October 2022

Accepted: 26 October 2022

Published: 07 December 2022