Construction site safety is a paramount concern, given the high rate of accidents and fatalities in the sector. This study introduces a novel approach to analyzing construction accident reports by employing advanced large language models (LLMs), specifically generative pre-trained transformer (GPT)-3.5, GPT-4.0, Gemini Pro, and large language model Meta artificial intelligence (AI) (LLaMA) 3.1. Our research focuses on the classification of key attributes in accident reports: root cause, injury cause, affected body part, severity, and accident time. The results reveal that GPT-4.0 achieves significantly higher accuracy across most attributes. Gemini Pro demonstrates superior performance in the “injury cause” classification, while LLaMA 3.1 excels in classifying “severity” and “root cause”. GPT-3.5, although lagging behind GPT-4.0, exhibits commendable accuracy. The insights gained from this study are vital for the construction industry, as they indicate the potential for developing more precise and effective safety measures. These findings could lead to a reduction in the frequency and severity of accidents, thereby enhancing worker safety.
A. J. P. Tixier, M. R. Hallowell, B. Rajagopalan, et al. Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports. Autom Constr, 2016, 62: 45–56.
Y. M. Goh, C. U. Ubeynarayana. Construction accident narrative classification: An evaluation of text mining techniques. Accid Anal Prev, 2017, 108: 122–130.
M. Y. Cheng, D. Kusoemo, R. A. Gosno. Text mining-based construction site accident classification using hybrid supervised machine learning. Autom Constr, 2020, 118: 103265.
F. Zhang. A hybrid structured deep neural network with Word2Vec for construction accident causes classification. Int J Constr Manage, 2022, 22: 1120–1140.
M. Alkaissy, M. Arashpour, E. M. Golafshani, et al. Enhancing construction safety: Machine learning-based classification of injury types. Saf Sci, 2023, 162: 106102.
X. X. Luo, X. C. Li, X. F. Song, et al. Convolutional neural network algorithm-based novel automatic text classification framework for construction accident reports. J Constr Eng Manage, 2023, 149: 04023128.
K. Kowsari, K. J. Meimandi, M. Heidarysafa, et al. Text classification algorithms: A survey. Information, 2019, 10: 150.
S. V. Balkus, D. H. Yan. Improving short text classification with augmented data using GPT-3. Nat Lang Eng, 2023, 30: 1–30.
X. Han, W. L. Zhao, N. Ding, et al. PTR: Prompt tuning with rules for text classification. AI Open, 2022, 3: 182–192.
M. Shanahan. Talking about large language models. Commun ACM, 2024, 67: 68–79.
C. Raffel, N. Shazeer, A. Roberts, et al. Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res, 2020, 21: 1–67.