Ajou University repository

Personalizing Diffusion Inpainting Model with Text-Free Finetuning
  • 김범조
Citations

SCOPUS

0

Citation Export

Advisor
Kyung-Ah Sohn
Affiliation
아주대학교 대학원
Department
일반대학원 인공지능학과
Publication Year
2024-02
Publisher
The Graduate School, Ajou University
Keyword
Diffusion ModelImage GenerationImage InpaintingImage manipulationSubject-Driven Generation
Description
학위논문(석사)--인공지능학과,2024. 2
Abstract
This thesis introduces a novel approach to subject-driven image generation, advancing the field by overcoming the limitations of traditional text-to-image diffusion models. Our method employs a model that generates images from reference images without the need for language-based prompts. By integrating our proposed module named as visual detail preserving module, the model captures intricate visual details and textures of subjects, addressing the common challenge of overfitting associated with a limited number of training samples. We further refine the model's performance through a modified classifier-free guidance technique and feature concatenation, enabling the generation of images where subjects are naturally positioned and harmonized within diverse scene contexts. Quantitative assessments using CLIP and DINO scores, complemented by a user study, demonstrate our model's superiority in fidelity, editability, and overall quality of generated images. Our contributions not only show the potential of leveraging pre-trained models and visual patch embeddings in subject-driven editing but also highlight the balance between diversity and fidelity in image generation tasks. Keywords: Diffusion Model, Image Generation, Image Inpainting, Subject-Driven Generation, Image Manipulation
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/38947
Journal URL
https://dcoll.ajou.ac.kr/dcollection/common/orgView/000000033534
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Total Views & Downloads

File Download

  • There are no files associated with this item.