Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training

Hong, Sungkook; Choi, Byungjoo; Ham, Youngjib; Jeon, Jung Ho; Kim, Hyunsoo

DC Field	Value	Language
dc.contributor.author	Hong, Sungkook	-
dc.contributor.author	Choi, Byungjoo	-
dc.contributor.author	Ham, Youngjib	-
dc.contributor.author	Jeon, Jung Ho	-
dc.contributor.author	Kim, Hyunsoo	-
dc.date.issued	2024-10-01	-
dc.identifier.issn	1474-0346	-
dc.identifier.uri	https://dspace.ajou.ac.kr/dev/handle/2018.oak/34505	-
dc.description.abstract	Advancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.	-
dc.description.sponsorship	This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) and the Ministry of Education (NRF-2022R1F1A1072450). In addition, this study was also supported by a grant (RS-2024-00143493) from Digital-Based Building Construction and Safety Supervision Technology Research Program funded by Ministry of Land, Infrastructure and Transport of Korean Government.	-
dc.language.iso	eng	-
dc.publisher	Elsevier Ltd	-
dc.subject.mesh	Activity recognition	-
dc.subject.mesh	Convolutional neural network	-
dc.subject.mesh	Efficientnet	-
dc.subject.mesh	Objects detection	-
dc.subject.mesh	Openpose	-
dc.subject.mesh	Stable diffusion	-
dc.subject.mesh	Synthesized images	-
dc.subject.mesh	Task estimation	-
dc.subject.mesh	Vision based	-
dc.subject.mesh	Vision-based activity recognition	-
dc.title	Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training	-
dc.type	Article	-
dc.citation.title	Advanced Engineering Informatics	-
dc.citation.volume	62	-
dc.identifier.bibliographicCitation	Advanced Engineering Informatics, Vol.62	-
dc.identifier.doi	10.1016/j.aei.2024.102866	-
dc.identifier.scopusid	2-s2.0-85205855354	-
dc.identifier.url	https://www.sciencedirect.com/science/journal/14740346	-
dc.subject.keyword	EfficientNet	-
dc.subject.keyword	Object detection	-
dc.subject.keyword	OpenPose	-
dc.subject.keyword	Stable diffusion	-
dc.subject.keyword	Task estimation	-
dc.subject.keyword	Vision-based activity recognition	-
dc.description.isoa	false	-
dc.subject.subarea	Information Systems	-
dc.subject.subarea	Artificial Intelligence	-

Show simple item record

qrcode

트윗하기

Related Researcher

Choi, Byungjoo 최병주: Department of Architecture

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download