Ajou University repository

Gramian Attention Heads are Strong yet Efficient Vision Learnersoa mark
Citations

SCOPUS

3

Citation Export

Publication Year
2023-01-01
Journal
Proceedings of the IEEE International Conference on Computer Vision
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings of the IEEE International Conference on Computer Vision, pp.5818-5828
Mesh Keyword
Architecture designsBuilding blockesChannel expansionsGramiansLearn+matrixNovel architecturePerformanceState of the artTrade off
All Science Classification Codes (ASJC)
SoftwareComputer Vision and Pattern Recognition
Abstract
We introduce a novel architecture design that enhances expressiveness by incorporating multiple head classifiers (i.e., classification heads) instead of relying on channel expansion or additional building blocks. Our approach employs attention-based aggregation, utilizing pairwise feature similarity to enhance multiple lightweight heads with minimal resource overhead. We compute the Gramian matrices to reinforce class tokens in an attention layer for each head. This enables the heads to learn more discriminative representations, enhancing their aggregation capabilities. Furthermore, we propose a learning algorithm that encourages heads to complement each other by reducing correlation for aggregation. Our models eventually surpass state-of-the-art CNNs and ViTs regarding the accuracy-throughput trade-off on ImageNet-1K and deliver remarkable performance across various downstream tasks, such as COCO object instance segmentation, ADE20k semantic segmentation, and fine-grained visual classification datasets. The effectiveness of our framework is substantiated by practical experimental results and further underpinned by generalization error bound. We release the code publicly at: https://github.com/Lab-LVM/imagenet-models.
ISSN
1550-5499
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/36949
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85188239686&origin=inward
DOI
https://doi.org/10.1109/iccv51070.2023.00537
Journal URL
http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000149
Type
Conference
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Ryu, Jongbin Image
Ryu, Jongbin유종빈
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.