Ajou University repository

Augmented reality, deep learning and vision-language query system for construction worker safetyoa mark
  • Chen, Haosen ;
  • Hou, Lei ;
  • Wu, Shaoze ;
  • Zhang, Guomin ;
  • Zou, Yang ;
  • Moon, Sungkon ;
  • Bhuiyan, Muhammed
Citations

SCOPUS

21

Citation Export

DC Field Value Language
dc.contributor.authorChen, Haosen-
dc.contributor.authorHou, Lei-
dc.contributor.authorWu, Shaoze-
dc.contributor.authorZhang, Guomin-
dc.contributor.authorZou, Yang-
dc.contributor.authorMoon, Sungkon-
dc.contributor.authorBhuiyan, Muhammed-
dc.date.issued2024-01-01-
dc.identifier.issn0926-5805-
dc.identifier.urihttps://dspace.ajou.ac.kr/dev/handle/2018.oak/33763-
dc.description.abstractLow situational awareness contributes to safety incidents in construction. Existing Deep Learning (DL)-based applications lack the capability to provide context-specific and interactive feedback that is essential for workers to fully understand their surrounding environments. This paper proposes the Visual Construction Safety Query (VCSQ) system. The system encompasses real-time Image Captioning (IC), safety-centric Visual Question Answering (VQA), and keyword-based Image-Text Retrieval (ITR), integrated with head-mounted Augmented Reality (AR) devices. System validation includes benchmarks and real-world images. The ITR module posted high recall rates of 0.801 and 0.835 for Recall@5 and @10. The VQA module achieved an 89.7% accuracy rate, and the IC module had a SPICE score of 0.449. Feasibility tests and surveys confirmed the system's practical advantages in different construction scenarios. This study establishes an integration roadmap adaptable to future advancements in interactive DL and immersive AR.-
dc.language.isoeng-
dc.publisherElsevier B.V.-
dc.subject.meshConstruction safety-
dc.subject.meshConstruction workers-
dc.subject.meshDeep learning-
dc.subject.meshImage captioning-
dc.subject.meshImage texts-
dc.subject.meshLanguage model-
dc.subject.meshQuery systems-
dc.subject.meshQuestion Answering-
dc.subject.meshText retrieval-
dc.subject.meshVision-language model-
dc.titleAugmented reality, deep learning and vision-language query system for construction worker safety-
dc.typeArticle-
dc.citation.titleAutomation in Construction-
dc.citation.volume157-
dc.identifier.bibliographicCitationAutomation in Construction, Vol.157-
dc.identifier.doi10.1016/j.autcon.2023.105158-
dc.identifier.scopusid2-s2.0-85175417154-
dc.identifier.urlhttps://www.journals.elsevier.com/automation-in-construction-
dc.subject.keywordAugmented reality-
dc.subject.keywordConstruction safety-
dc.subject.keywordDeep learning-
dc.subject.keywordVision-language models-
dc.description.isoatrue-
dc.subject.subareaControl and Systems Engineering-
dc.subject.subareaCivil and Structural Engineering-
dc.subject.subareaBuilding and Construction-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Moon, Sung Kon Image
Moon, Sung Kon문성곤
Department of Civil Systems Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.