Citation Export
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kang, Hankyul | - |
| dc.contributor.author | Ryu, Jongbin | - |
| dc.date.issued | 2025-01-01 | - |
| dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/38562 | - |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105003625128&origin=inward | - |
| dc.description.abstract | In neural networks, recognizing visual patterns is challenging because global average pooling disregards local patterns and solely relies on over-concentrated activation. Global average pooling enforces the network to learn objects regardless of their location, so features tend to be activated only in specific regions. To support this claim, we provide a novel analysis of the problems that over-concentration brings about in networks with extensive experiments. We analyze the over-concentration through problems arising from feature variance and dead neurons that are not activated. Based on our analysis, we introduce a multi-token attention pooling layer to alleviate the over-concentration problem. Our attention-pooling layer captures broad-sight local patterns by learning multiple tokens with the proposed distillation algorithm. It resolves the high bias and high variance errors of learned multi-tokens, which is crucial when aggregating local patterns with multi-tokens. Our method applies to various vision tasks and network architectures such as CNN, ViT, and MLP-Mixer. The proposed method improves baselines with few extra resources, and a network employing our pooling method works favorably against state-of-the-art networks. We open-source the code at https://github.com/Lab-LVM/imagenet-models. | - |
| dc.description.sponsorship | This paper was supported in part by the ETRI Grant funded by Korean Government (Fundamental Technology Research for Human-Centric Autonomous Intelligent Systems) under Grant 24ZB1200, Artificial Intelligence Innovation Hub (RS-2021-II212068), Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS-2023-00255968), and the NRF Grant (RS-2024-00356486). | - |
| dc.language.iso | eng | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.subject.mesh | Bias and variance | - |
| dc.subject.mesh | Bias-and-variance-error | - |
| dc.subject.mesh | Learn+ | - |
| dc.subject.mesh | Local patterns | - |
| dc.subject.mesh | Multi tokens | - |
| dc.subject.mesh | Multi-token attention pooling | - |
| dc.subject.mesh | Neural-networks | - |
| dc.subject.mesh | Over-concentration | - |
| dc.subject.mesh | Variance error | - |
| dc.subject.mesh | Visual pattern | - |
| dc.title | Enriching Local Patterns with Multi-Token Attention for Broad-Sight Neural Networks | - |
| dc.type | Conference | - |
| dc.citation.conferenceDate | 2025.02.28.~2025.03.04. | - |
| dc.citation.conferenceName | 2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 | - |
| dc.citation.edition | Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025 | - |
| dc.citation.endPage | 8279 | - |
| dc.citation.startPage | 8270 | - |
| dc.citation.title | Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025 | - |
| dc.identifier.bibliographicCitation | Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025, pp.8270-8279 | - |
| dc.identifier.doi | 10.1109/wacv61041.2025.00802 | - |
| dc.identifier.scopusid | 2-s2.0-105003625128 | - |
| dc.identifier.url | http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=10943266 | - |
| dc.subject.keyword | bias-and-variance-error | - |
| dc.subject.keyword | multi-token attention pooling | - |
| dc.subject.keyword | over-concentration | - |
| dc.type.other | Conference Paper | - |
| dc.subject.subarea | Artificial Intelligence | - |
| dc.subject.subarea | Computer Science Applications | - |
| dc.subject.subarea | Computer Vision and Pattern Recognition | - |
| dc.subject.subarea | Human-Computer Interaction | - |
| dc.subject.subarea | Modeling and Simulation | - |
| dc.subject.subarea | Radiology, Nuclear Medicine and Imaging | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.