Analyzing to discover origins of CNNs and ViT architectures in medical images

Oh, Seungmin; Kim, Namkug; Ryu, Jongbin

DC Field	Value	Language
dc.contributor.author	Oh, Seungmin	-
dc.contributor.author	Kim, Namkug	-
dc.contributor.author	Ryu, Jongbin	-
dc.date.issued	2024-12-01	-
dc.identifier.issn	2045-2322	-
dc.identifier.uri	https://dspace.ajou.ac.kr/dev/handle/2018.oak/34135	-
dc.description.abstract	In this paper, we introduce in-depth the analysis of CNNs and ViT architectures in medical images, with the goal of providing insights into subsequent research direction. In particular, the origins of deep neural networks should be explainable for medical images, but there has been a paucity of studies on such explainability in the aspect of deep neural network architectures. Therefore, we investigate the origin of model performance, which is the clue to explaining deep neural networks, focusing on the two most relevant architectures, such as CNNs and ViT. We give four analyses, including (1) robustness in a noisy environment, (2) consistency in translation invariance property, (3) visual recognition with obstructed images, and (4) acquired features from shape or texture so that we compare origins of CNNs and ViT that cause the differences of visual recognition performance. Furthermore, the discrepancies between medical and generic images are explored regarding such analyses. We discover that medical images, unlike generic ones, exhibit class-sensitive. Finally, we propose a straightforward ensemble method based on our analyses, demonstrating that our findings can help build follow-up studies. Our analysis code will be publicly available.	-
dc.description.sponsorship	This work was supported in part by the National Research Foundation of Korea (NRF) by Institute of Information and Communications Technology Planning and Evaluation (IITP) grant funded by the Korea Government (MSIT) (Artificial Intelligence Innovation Hub) under Grant 2021-0-02068, under Grant NRF-2021R1F1A1062807, Korea Health Technology R &D Project (KHIDI), funded by the MOHW under Grant RS-2023-00266038, and under the Artificial Intelligence Convergence Innovation Human Resources Development (RS-2023-00255968) Grant.	-
dc.language.iso	eng	-
dc.publisher	Nature Research	-
dc.subject.mesh	Neural Networks, Computer	-
dc.title	Analyzing to discover origins of CNNs and ViT architectures in medical images	-
dc.type	Article	-
dc.citation.title	Scientific Reports	-
dc.citation.volume	14	-
dc.identifier.bibliographicCitation	Scientific Reports, Vol.14	-
dc.identifier.doi	10.1038/s41598-024-58382-3	-
dc.identifier.pmid	38627477	-
dc.identifier.scopusid	2-s2.0-85190499013	-
dc.identifier.url	https://www.nature.com/srep/	-
dc.description.isoa	true	-
dc.subject.subarea	Multidisciplinary	-

Show simple item record

qrcode

트윗하기

Related Researcher

Ryu, Jongbin유종빈: Department of Software and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download