Ajou University repository

Decomposing texture and semantic for out-of-distribution detection[Formula presented]oa mark
Citations

SCOPUS

1

Citation Export

Publication Year
2024-03-15
Publisher
Elsevier Ltd
Citation
Expert Systems with Applications, Vol.238
Keyword
Feature representationOut-of-distribution detectionRobustness
Mesh Keyword
AI systemsData distributionDetection methodsDetection tasksFeature representationOut-of-distribution detectionReal data setsRobustnessTraining dataTraining dataset
All Science Classification Codes (ASJC)
Engineering (all)Computer Science ApplicationsArtificial Intelligence
Abstract
The out-of-distribution (OOD) detection task assumes samples that follow the distribution of training data as in-distribution (ID), while samples from other data distributions are considered OOD. In recent years, the OOD detection tasks have made significant progress since many studies observed that the distribution mismatch between training and real datasets can severely deteriorate the reliability of AI systems. Nevertheless, the lack of precise interpretation for the in-distribution (ID) limits the application of the OOD detection methods to real-world systems. To tackle this, we decompose the definition of the ID into texture and semantics, motivated by the demands of real-world scenarios. We also design new benchmarks to measure the robustness that OOD detection methods should have. Our proposed benchmark verifies not only the precision but also the robustness of the detection models. It is crucial to measure both factors in OOD detection as they indicate different traits of the model. For instance, precision is relevant to scenarios that detect minor cracks in the conveyor belt of a smart factory, whereas robustness pertains to maintaining performance under diverse weather conditions, as required by autonomous driving. To achieve a good balance between the OOD detection performance and robustness, our method takes a divide-and-conquer approach. Specifically, the proposed model first handles each component of the texture and semantics separately and then fuses these later. This philosophy is empirically proven by a series of benchmarks including both the proposed and the conventional counterpart. By decomposing the prior “unclear” definition of the ID into texture and semantic components, our novel approach better suits the demands of a reliable machine learning system, which requires robustness and consistent performance across varied scenarios. Unlike prior works, our approach does not rely on any extra datasets or labels. This prevents our proposed framework from being dependent on a particular dataset distribution.
ISSN
0957-4174
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/33706
DOI
https://doi.org/10.1016/j.eswa.2023.121829
Fulltext

Type
Article
Funding
This research was supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) funded by Ministry of Science and ICT of the Korea Government (MSIT) under the Artificial Intelligence Convergence Innovation Human Resources Development ( IITP-2023-RS-2023-00255968 ) grant and under Grant 2021-0-02068 (Artificial Intelligence Innovation Hub), and also by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) ( NRF-2022R1A2C1007434 ).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Sohn, Kyung-Ah Image
Sohn, Kyung-Ah손경아
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.