Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training

Keyword: EfficientNet Object detection OpenPose Stable diffusion Task estimation Vision-based activity recognition

Mesh Keyword: Activity recognition Convolutional neural network Efficientnet Objects detection Openpose Stable diffusion Synthesized images Task estimation Vision based Vision-based activity recognition

All Science Classification Codes (ASJC): Information Systems Artificial Intelligence

Abstract: Advancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.

Funding: This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) and the Ministry of Education (NRF-2022R1F1A1072450). In addition, this study was also supported by a grant (RS-2024-00143493) from Digital-Based Building Construction and Safety Supervision Technology Research Program funded by Ministry of Land, Infrastructure and Transport of Korean Government.

qrcode