Ajou University repository

Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training
  • Hong, Sungkook ;
  • Choi, Byungjoo ;
  • Ham, Youngjib ;
  • Jeon, Jung Ho ;
  • Kim, Hyunsoo
Citations

SCOPUS

4

Citation Export

DC Field Value Language
dc.contributor.authorHong, Sungkook-
dc.contributor.authorChoi, Byungjoo-
dc.contributor.authorHam, Youngjib-
dc.contributor.authorJeon, Jung Ho-
dc.contributor.authorKim, Hyunsoo-
dc.date.issued2024-10-01-
dc.identifier.issn1474-0346-
dc.identifier.urihttps://dspace.ajou.ac.kr/dev/handle/2018.oak/34505-
dc.description.abstractAdvancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.-
dc.description.sponsorshipThis research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) and the Ministry of Education (NRF-2022R1F1A1072450). In addition, this study was also supported by a grant (RS-2024-00143493) from Digital-Based Building Construction and Safety Supervision Technology Research Program funded by Ministry of Land, Infrastructure and Transport of Korean Government.-
dc.language.isoeng-
dc.publisherElsevier Ltd-
dc.subject.meshActivity recognition-
dc.subject.meshConvolutional neural network-
dc.subject.meshEfficientnet-
dc.subject.meshObjects detection-
dc.subject.meshOpenpose-
dc.subject.meshStable diffusion-
dc.subject.meshSynthesized images-
dc.subject.meshTask estimation-
dc.subject.meshVision based-
dc.subject.meshVision-based activity recognition-
dc.titleMassive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training-
dc.typeArticle-
dc.citation.titleAdvanced Engineering Informatics-
dc.citation.volume62-
dc.identifier.bibliographicCitationAdvanced Engineering Informatics, Vol.62-
dc.identifier.doi10.1016/j.aei.2024.102866-
dc.identifier.scopusid2-s2.0-85205855354-
dc.identifier.urlhttps://www.sciencedirect.com/science/journal/14740346-
dc.subject.keywordEfficientNet-
dc.subject.keywordObject detection-
dc.subject.keywordOpenPose-
dc.subject.keywordStable diffusion-
dc.subject.keywordTask estimation-
dc.subject.keywordVision-based activity recognition-
dc.description.isoafalse-
dc.subject.subareaInformation Systems-
dc.subject.subareaArtificial Intelligence-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Choi, Byungjoo  Image
Choi, Byungjoo 최병주
Department of Architecture
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.