Ajou University repository

Enriching Local Patterns with Multi-Token Attention for Broad-Sight Neural Networks
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorKang, Hankyul-
dc.contributor.authorRyu, Jongbin-
dc.date.issued2025-01-01-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/38562-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105003625128&origin=inward-
dc.description.abstractIn neural networks, recognizing visual patterns is challenging because global average pooling disregards local patterns and solely relies on over-concentrated activation. Global average pooling enforces the network to learn objects regardless of their location, so features tend to be activated only in specific regions. To support this claim, we provide a novel analysis of the problems that over-concentration brings about in networks with extensive experiments. We analyze the over-concentration through problems arising from feature variance and dead neurons that are not activated. Based on our analysis, we introduce a multi-token attention pooling layer to alleviate the over-concentration problem. Our attention-pooling layer captures broad-sight local patterns by learning multiple tokens with the proposed distillation algorithm. It resolves the high bias and high variance errors of learned multi-tokens, which is crucial when aggregating local patterns with multi-tokens. Our method applies to various vision tasks and network architectures such as CNN, ViT, and MLP-Mixer. The proposed method improves baselines with few extra resources, and a network employing our pooling method works favorably against state-of-the-art networks. We open-source the code at https://github.com/Lab-LVM/imagenet-models.-
dc.description.sponsorshipThis paper was supported in part by the ETRI Grant funded by Korean Government (Fundamental Technology Research for Human-Centric Autonomous Intelligent Systems) under Grant 24ZB1200, Artificial Intelligence Innovation Hub (RS-2021-II212068), Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS-2023-00255968), and the NRF Grant (RS-2024-00356486).-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshBias and variance-
dc.subject.meshBias-and-variance-error-
dc.subject.meshLearn+-
dc.subject.meshLocal patterns-
dc.subject.meshMulti tokens-
dc.subject.meshMulti-token attention pooling-
dc.subject.meshNeural-networks-
dc.subject.meshOver-concentration-
dc.subject.meshVariance error-
dc.subject.meshVisual pattern-
dc.titleEnriching Local Patterns with Multi-Token Attention for Broad-Sight Neural Networks-
dc.typeConference-
dc.citation.conferenceDate2025.02.28.~2025.03.04.-
dc.citation.conferenceName2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025-
dc.citation.editionProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025-
dc.citation.endPage8279-
dc.citation.startPage8270-
dc.citation.titleProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025-
dc.identifier.bibliographicCitationProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025, pp.8270-8279-
dc.identifier.doi10.1109/wacv61041.2025.00802-
dc.identifier.scopusid2-s2.0-105003625128-
dc.identifier.urlhttp://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=10943266-
dc.subject.keywordbias-and-variance-error-
dc.subject.keywordmulti-token attention pooling-
dc.subject.keywordover-concentration-
dc.type.otherConference Paper-
dc.subject.subareaArtificial Intelligence-
dc.subject.subareaComputer Science Applications-
dc.subject.subareaComputer Vision and Pattern Recognition-
dc.subject.subareaHuman-Computer Interaction-
dc.subject.subareaModeling and Simulation-
dc.subject.subareaRadiology, Nuclear Medicine and Imaging-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Ryu, Jongbin Image
Ryu, Jongbin유종빈
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.