Large language models (LLMs) are gaining attention due to their potential to enhance efficiency and sustainability in the building domain, a critical area for reducing global carbon emissions. Built on transformer architectures, LLMs excel at text generation and data analysis, enabling applications such as automated energy model generation, energy management optimization, and fault detection and diagnosis. These models can potentially streamline complex workflows, enhance decision-making, and improve energy efficiency. However, integrating LLMs into building energy systems poses challenges, including high computational demands, data preparation costs, and the need for domain-specific customization. This perspective paper explores the role of LLMs in the building energy system sector, highlighting their potential applications and limitations. We propose a development roadmap built on in-context learning, domain-specific fine-tuning, retrieval augmented generation, and multimodal integration to enhance LLMs’ customization and practical use in this field. This paper aims to spark ideas for bridging the gap between LLMs capabilities and practical building applications, offering insights into the future of LLM-driven methods in building energy applications.
Decardi-Nelson B, Alshehri AS, Ajagekar A, et al. (2024). Generative AI and process systems engineering: The next frontier. Computers & Chemical Engineering, 187: 108723.
Jiang G, Ma Z, Zhang L, et al. (2024). EPlus-LLM: A large language model-based computing platform for automated building energy modeling. Applied Energy, 367: 123431.
Jose S, Nguyen KTP, Medjaher K, et al. (2024). Advancing multimodal diagnostics: Integrating industrial textual data and domain knowledge with large language models. Expert Systems with Applications, 255: 124603.
Livne M, Miftahutdinov Z, Tutubalina E, et al. (2024). nach0: Multimodal natural and chemical languages foundation model. Chemical Science, 15: 8380–8389.
Lu J, Tian X, Zhang C, et al. (2024). Evaluation of large language models (LLMs) on the mastery of knowledge and skills in the heating, ventilation and air conditioning (HVAC) industry. Energy and Built Environment, https://doi.org/10.1016/j.enbenv.2024.03.010
O’Brien W, Wagner A, Schweiker M, et al. (2020). Introducing IEA EBC annex 79: Key challenges and opportunities in the field of occupant-centric building design and operation. Building and Environment, 178: 106738.
Pan S, Luo L, Wang Y, et al. (2024). Unifying large language models and knowledge graphs: A roadmap. IEEE Transactions on Knowledge and Data Engineering, 36: 3580–3599.
Pradhan O, Wen J, Hälleberg D, et al. (2024). Evaluation of data imputation approaches for multi-stream building systems data 1. Science and Technology for the Built Environment, 30: 1035–1048.
Pu H, Yang X, Li J, et al. (2024). AutoRepo: A general framework for multimodal LLM-based automated construction reporting. Expert Systems with Applications, 255: 124601.
Sun Y, Haghighat F, Fung BCM (2022). Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods. Energy, 239: 122273.
Wang L, Ma C, Feng X, et al. (2024). A survey on large language model based autonomous agents. Frontiers of Computer Science, 18: 186345.
Wu T, Ling Q (2024). STELLM: Spatio-temporal enhanced pre-trained large language model for wind speed forecasting. Applied Energy, 375: 124034.
Yao Y, Duan J, Xu K, et al. (2024). A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly. High-Confidence Computing, 4: 100211.
Zhang L, Chen Z (2024). Large language model-based interpretable machine learning control in building energy systems. Energy and Buildings, 313: 114278.
Zhang C, Lu J, Zhao Y (2024a). Generative pre-trained transformers (GPT)-based automated data mining for building energy management: Advantages, limitations and the future. Energy and Built Environment, 5: 143–169.
Zhang C, Zhang J, Zhao Y, et al. (2024b). Automated data mining framework for building energy conservation aided by generative pre-trained transformers (GPT). Energy and Buildings, 305: 113877.
Zhang L, Chen Z, Ford V (2024c). Advancing building energy modeling with large language models: Exploration and case studies. Energy and Buildings, 323: 114788.
Zhang J, Zhang C, Lu J, et al. (2025a). Domain-specific large language models for fault diagnosis of heating, ventilation, and air conditioning systems by labeled-data-supervised fine-tuning. Applied Energy, 377: 124378.
Zhang L, Ford V, Chen Z, et al. (2025b). Automatic building energy model development and debugging using large language models agentic workflow. Energy and Buildings, 327: 115116.
Zhang Y, Wang D, Wang G, et al. (2025c). Data-driven building load prediction and large language models: Comprehensive overview. Energy and Buildings, 326: 115001.
Zhao H, Chen H, Yang F, et al. (2024). Explainability for large language models: A survey. ACM Transactions on Intelligent Systems and Technology, 15: 1–38.