Ajou University repository

Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation
Citations

SCOPUS

6

Citation Export

Publication Year
2024-06-01
Publisher
Springer
Citation
International Journal of Computer Vision, Vol.132, pp.2041-2059
Keyword
Data augmentationGeneralizationImage denoisingImage restorationImage super-resolutionJPEG artifact removalMultiple degradations restoration
Mesh Keyword
'currentArtifact removalData augmentationGeneralisationImage super resolutionsJPEG artifact removalLow-level visionMultiple degradation restorationMultiple degradationsPerformance
All Science Classification Codes (ASJC)
SoftwareComputer Vision and Pattern RecognitionArtificial Intelligence
Abstract
Data augmentation (DA) is an effective way to improve the performance of deep networks. Unfortunately, current methods are mostly developed for high-level vision tasks (eg, image classification) and few are studied for low-level (eg, image restoration). In this paper, we provide a comprehensive analysis of the existing DAs in the frequency domain. We find that the methods that largely manipulate the spatial information can hinder the image restoration process and hurt the performance. Based on our analyses, we propose CutBlur and mixture-of-augmentation (MoA). CutBlur cuts a low-quality patch and pastes it to the corresponding high-quality image region, or vice versa. The key intuition is to provide enough DA effect while keeping the pixel distribution intact. This characteristic of CutBlur enables a model to learn not only “how” but also “where” to reconstruct an image. Eventually, the model understands “how much” to restore given pixels, which allows it to generalize better to unseen data distributions. We further improve the restoration performance by MoA that incorporates the curated list of DAs. We demonstrate the effectiveness of our methods by conducting extensive experiments on several low-level vision tasks on both single or a mixture of distortion tasks. Our results show that CutBlur and MoA consistently and significantly improve the performance especially when the model size is big and the data is collected under real-world environments. Our code is available at https://github.com/clovaai/cutblur.
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/33874
DOI
https://doi.org/10.1007/s11263-023-01970-z
Fulltext

Type
Article
Funding
This work was by the Korea Research Institute for Defence Technology Planning and Advancement (KRIT) grant funded by the Korea government (DAPA) in 2022 (KRIT-CT-22-037, SAR Image Super-Resolution for Improving of Target Identification Performance, 50%), National Research Foundation of Korea Grants funded by the Korea government (MSIT) (No. NRF-2019R1A2C1006608, 5%, No. 2.220574.01, 15%), Institute of Information & communications Technology Planning & Evaluation(IITP) Grants funded by the ITRC (Information Technology Research Center) support program (IITP-2020-2018-0-01431, 5%) and MSIT No.2020-0-01336 5%, Artificial Intelligence Graduate School Program (UNIST), 5%, No.2021-0-02068, Artificial Intelligence Innovation Hub, 5%, No.2022-0-00959, (Part 2) Few-Shot Learning of Causal Inference in Vision and Language for Decision Making, 5%, No.2022-0-00264, Comprehensive Video Understanding and Generation with Knowledge-based Deep Logic Neural Network, 5%).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Sohn, Kyung-Ah Image
Sohn, Kyung-Ah손경아
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.