Ajou University repository

Knowledge distillation for semantic segmentation using channel and spatial correlations and adaptive cross entropyoa mark
Citations

SCOPUS

31

Citation Export

Publication Year
2020-08-02
Publisher
MDPI AG
Citation
Sensors (Switzerland), Vol.20, pp.1-19
Keyword
Adaptive cross entropy lossChannel and spatial correlation lossKnowledge distillationSemantic segmentation
Mesh Keyword
Computational resourcesDistillation methodMobile applicationsProbability mapsSemantic segmentationSpatial correlationsSpatial informationsStudent network
All Science Classification Codes (ASJC)
Analytical ChemistryInformation SystemsAtomic and Molecular Physics, and OpticsBiochemistryInstrumentationElectrical and Electronic Engineering
Abstract
In this paper, we propose an efficient knowledge distillation method to train light networks using heavy networks for semantic segmentation. Most semantic segmentation networks that exhibit good accuracy are based on computationally expensive networks. These networks are not suitable for mobile applications using vision sensors, because computational resources are limited in these environments. In this view, knowledge distillation, which transfers knowledge from heavy networks acting as teachers to light networks as students, is suitable methodology. Although previous knowledge distillation approaches have been proven to improve the performance of student networks, most methods have some limitations. First, they tend to use only the spatial correlation of feature maps and ignore the relational information of their channels. Second, they can transfer false knowledge when the results of the teacher networks are not perfect. To address these two problems, we propose two loss functions: a channel and spatial correlation (CSC) loss function and an adaptive cross entropy (ACE) loss function. The former computes the full relationship of both the channel and spatial information in the feature map, and the latter adaptively exploits one-hot encodings using the ground truth labels and the probability maps predicted by the teacher network. To evaluate our method, we conduct experiments on scene parsing datasets: Cityscapes and Camvid. Our method presents significantly better performance than previous methods.
ISSN
1424-8220
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/31474
DOI
https://doi.org/10.3390/s20164616
Fulltext

Type
Article
Funding
Funding: This work was supported by the Ministry of Science and ICT (MSIT), South Korea, under the Information Technology Research Center (ITRC) Support Program supervised by the Institute for Information and Communications Technology Promotion (IITP) under Grant IITP-2020-2018-0-01424.This work was supported by the Ministry of Science and ICT (MSIT), South Korea, under the Information Technology Research Center (ITRC) Support Program supervised by the Institute for Information and Communications Technology Promotion (IITP) under Grant IITP-2020-2018-0-01424.
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Heo,Yong Seok  Image
Heo,Yong Seok 허용석
Department of Electrical and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.