Anti-focal loss for speech recognition on small-scale datasets

Ren, Jiakai; Jin, Rize; Chung, Tae Sun

DC Field	Value	Language
dc.contributor.author	Ren, Jiakai	-
dc.contributor.author	Jin, Rize	-
dc.contributor.author	Chung, Tae Sun	-
dc.date.issued	2021-08-20	-
dc.identifier.uri	https://aurora.ajou.ac.kr/handle/2018.oak/36714	-
dc.identifier.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85117960182&origin=inward	-
dc.description.abstract	Deep learning models with encoder-decoder architecture become popular in automatic speech recognition systems, due to their success in sequential prediction tasks. Recently, the conformer model has greatly improved the accuracy of speech recognition. However, similar to transformer models, its training relies on a large amount of data. This paper explores an efficient few-shot learning strategy. Specifically, a spec-augment approach is proposed to augment the speech dataset, then a novel loss function, anti-focal loss, is introduced to encourage fast convergence in a small-scale, unbalanced data setting. Extensive experiments on aishell-l dataset show that our model outperforms state-of-the-art approaches under limited support data, in terms of convergence speed and generalization ability.	-
dc.language.iso	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.subject.mesh	Automatic speech recognition system	-
dc.subject.mesh	Encoder-decoder architecture	-
dc.subject.mesh	Few-shot learning	-
dc.subject.mesh	Learning models	-
dc.subject.mesh	Prediction tasks	-
dc.subject.mesh	Sequential prediction	-
dc.subject.mesh	Small scale	-
dc.subject.mesh	Small-scale data	-
dc.subject.mesh	Transformer	-
dc.subject.mesh	Transformer modeling	-
dc.title	Anti-focal loss for speech recognition on small-scale datasets	-
dc.type	Conference	-
dc.citation.conferenceDate	2021.08.20.~2021.08.22.	-
dc.citation.conferenceName	4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021	-
dc.citation.edition	2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021	-
dc.citation.endPage	22	-
dc.citation.startPage	19	-
dc.citation.title	2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021	-
dc.identifier.bibliographicCitation	2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021, pp.19-22	-
dc.identifier.doi	10.1109/prai53619.2021.9550804	-
dc.identifier.scopusid	2-s2.0-85117960182	-
dc.identifier.url	http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9550757	-
dc.subject.keyword	Few-shot learning	-
dc.subject.keyword	Small-scale data	-
dc.subject.keyword	Speech recognition	-
dc.subject.keyword	Transformer	-
dc.type.other	Conference Paper	-
dc.description.isoa	false	-
dc.subject.subarea	Artificial Intelligence	-
dc.subject.subarea	Computer Vision and Pattern Recognition	-

Show simple item record

qrcode

트윗하기

Related Researcher

Chung, Tae-Sun정태선: Department of Software and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download