Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation

Ahn, Namhyuk; Yoo, Jaejun; Sohn, Kyung Ah

Publication Year: 2024-06-01

Publisher: Springer

Citation: International Journal of Computer Vision, Vol.132, pp.2041-2059

Keyword: Data augmentation Generalization Image denoising Image restoration Image super-resolution JPEG artifact removal Multiple degradations restoration

Mesh Keyword: 'current Artifact removal Data augmentation Generalisation Image super resolutions JPEG artifact removal Low-level vision Multiple degradation restoration Multiple degradations Performance

All Science Classification Codes (ASJC): Software Computer Vision and Pattern Recognition Artificial Intelligence

Abstract: Data augmentation (DA) is an effective way to improve the performance of deep networks. Unfortunately, current methods are mostly developed for high-level vision tasks (eg, image classification) and few are studied for low-level (eg, image restoration). In this paper, we provide a comprehensive analysis of the existing DAs in the frequency domain. We find that the methods that largely manipulate the spatial information can hinder the image restoration process and hurt the performance. Based on our analyses, we propose CutBlur and mixture-of-augmentation (MoA). CutBlur cuts a low-quality patch and pastes it to the corresponding high-quality image region, or vice versa. The key intuition is to provide enough DA effect while keeping the pixel distribution intact. This characteristic of CutBlur enables a model to learn not only “how” but also “where” to reconstruct an image. Eventually, the model understands “how much” to restore given pixels, which allows it to generalize better to unseen data distributions. We further improve the restoration performance by MoA that incorporates the curated list of DAs. We demonstrate the effectiveness of our methods by conducting extensive experiments on several low-level vision tasks on both single or a mixture of distortion tasks. Our results show that CutBlur and MoA consistently and significantly improve the performance especially when the model size is big and the data is collected under real-world environments. Our code is available at https://github.com/clovaai/cutblur.

Language: eng

URI: https://dspace.ajou.ac.kr/dev/handle/2018.oak/33874

DOI: https://doi.org/10.1007/s11263-023-01970-z

Fulltext

Type: Article

Funding: This work was by the Korea Research Institute for Defence Technology Planning and Advancement (KRIT) grant funded by the Korea government (DAPA) in 2022 (KRIT-CT-22-037, SAR Image Super-Resolution for Improving of Target Identification Performance, 50%), National Research Foundation of Korea Grants funded by the Korea government (MSIT) (No. NRF-2019R1A2C1006608, 5%, No. 2.220574.01, 15%), Institute of Information & communications Technology Planning & Evaluation(IITP) Grants funded by the ITRC (Information Technology Research Center) support program (IITP-2020-2018-0-01431, 5%) and MSIT No.2020-0-01336 5%, Artificial Intelligence Graduate School Program (UNIST), 5%, No.2021-0-02068, Artificial Intelligence Innovation Hub, 5%, No.2022-0-00959, (Part 2) Few-Shot Learning of Causal Inference in Vision and Language for Decision Making, 5%, No.2022-0-00264, Comprehensive Video Understanding and Generation with Knowledge-based Deep Logic Neural Network, 5%).

Show full item record

qrcode

트윗하기

Related Researcher

Sohn, Kyung-Ah손경아: Department of Software and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download