Ajou University repository

Text-free diffusion inpainting using reference images for enhanced visual fidelityoa mark
Citations

SCOPUS

0

Citation Export

Publication Year
2024-10-01
Publisher
Elsevier B.V.
Citation
Pattern Recognition Letters, Vol.186, pp.221-228
Keyword
Diffusion modelsImage generationImage inpaintingImage manipulationSubject-driven generation
Mesh Keyword
Diffusion modelFree diffusionImage diffusionImage generationsImage InpaintingImage manipulationInpaintingReference imageSubject-driven generationVisual fidelity
All Science Classification Codes (ASJC)
SoftwareSignal ProcessingComputer Vision and Pattern RecognitionArtificial Intelligence
Abstract
This paper presents a novel approach to subject-driven image generation that addresses the limitations of traditional text-to-image diffusion models. Our method generates images using reference images without relying on language-based prompts. We introduce a visual detail preserving module that captures intricate details and textures, addressing overfitting issues associated with limited training samples. The model's performance is further enhanced through a modified classifier-free guidance technique and feature concatenation, enabling the natural positioning and harmonization of subjects within diverse scenes. Quantitative assessments using CLIP, DINO and Quality scores (QS), along with a user study, demonstrate the superior quality of our generated images. Our work highlights the potential of pre-trained models and visual patch embeddings in subject-driven editing, balancing diversity and fidelity in image generation tasks. Our implementation is available at https://github.com/8eomio/Subject-Inpainting.
ISSN
0167-8655
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/34536
DOI
https://doi.org/10.1016/j.patrec.2024.10.009
Fulltext

Type
Article
Funding
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2024-2021-0-02051), the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS2023-00255968) grant, and Grant RS-2021-II212068 (Artificial Intelligence Innovation Hub), supervised by the Institute for Information & Communications Technology Planning & Evaluation (IITP), and also by the National Research Foundation of Korea(NRF) grant (No. NRF2022R1A2C1007434).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Sohn, Kyung-Ah Image
Sohn, Kyung-Ah손경아
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.