Learning Neural Networks without Lazy Weights

Lee, Dong Gi; Cho, Junhee; Kim, Myungjun; Park, Sunghong; Shin, Hyunjung

DC Field	Value	Language
dc.contributor.author	Lee, Dong Gi	-
dc.contributor.author	Cho, Junhee	-
dc.contributor.author	Kim, Myungjun	-
dc.contributor.author	Park, Sunghong	-
dc.contributor.author	Shin, Hyunjung	-
dc.date.issued	2022-01-01	-
dc.identifier.uri	https://aurora.ajou.ac.kr/handle/2018.oak/36787	-
dc.identifier.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127599667&origin=inward	-
dc.description.abstract	Various approaches have been suggested for the regularization of neural networks, including the well-known Dropout and Dropconnect, which are simple and efficient to implement and therefore have been widely used. However, there is a risk of loss of well-trained weights when dropping nodes or weights randomly. In this paper, we propose a regularization method that preserves well-trained weights and removes poorly trained weights. This was motivated by the observation that the trained weights become further trained. We define these as eager weights whereas the opposite as lazy weights. On every weight update, the distribution of the changes in weight values is examined, and the lazy weights are removed layer-wise. The results demonstrate that the proposed method has a faster convergence rate, avoids overfitting, and outperforms competing methods on the classification of benchmark datasets.	-
dc.description.sponsorship	ACKNOWLEDGMENT The authors would like to gratefully acknowledge supported from the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021R1A2C2003474), BK21 FOUR program of the National Research Foundation of Korea funded by the Ministry of Education (NRF5199991014091) and the Ajou University research fund.	-
dc.language.iso	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.subject.mesh	Deep learning	-
dc.subject.mesh	Layer-wise	-
dc.subject.mesh	Learning neural networks	-
dc.subject.mesh	Neural-networks	-
dc.subject.mesh	Overfitting	-
dc.subject.mesh	Regularisation	-
dc.subject.mesh	Regularization methods	-
dc.subject.mesh	Simple++	-
dc.subject.mesh	Weight update	-
dc.subject.mesh	Weight values	-
dc.title	Learning Neural Networks without Lazy Weights	-
dc.type	Conference	-
dc.citation.conferenceDate	2022.01.17.~2022.01.20.	-
dc.citation.conferenceName	2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022	-
dc.citation.edition	Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022	-
dc.citation.endPage	87	-
dc.citation.startPage	82	-
dc.citation.title	Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022	-
dc.identifier.bibliographicCitation	Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022, pp.82-87	-
dc.identifier.doi	10.1109/bigcomp54360.2022.00026	-
dc.identifier.scopusid	2-s2.0-85127599667	-
dc.identifier.url	http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9736461	-
dc.subject.keyword	Deep learning	-
dc.subject.keyword	Neural networks	-
dc.subject.keyword	Overfitting	-
dc.subject.keyword	Regularization	-
dc.type.other	Conference Paper	-
dc.description.isoa	false	-
dc.subject.subarea	Artificial Intelligence	-
dc.subject.subarea	Computer Science Applications	-
dc.subject.subarea	Computer Vision and Pattern Recognition	-
dc.subject.subarea	Information Systems and Management	-
dc.subject.subarea	Health Informatics	-

Show simple item record

qrcode

트윗하기

Related Researcher

Shin, HyunJung신현정: Department of Industrial Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download