Ajou University repository

Anti-focal loss for speech recognition on small-scale datasets
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorRen, Jiakai-
dc.contributor.authorJin, Rize-
dc.contributor.authorChung, Tae Sun-
dc.date.issued2021-08-20-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/36714-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85117960182&origin=inward-
dc.description.abstractDeep learning models with encoder-decoder architecture become popular in automatic speech recognition systems, due to their success in sequential prediction tasks. Recently, the conformer model has greatly improved the accuracy of speech recognition. However, similar to transformer models, its training relies on a large amount of data. This paper explores an efficient few-shot learning strategy. Specifically, a spec-augment approach is proposed to augment the speech dataset, then a novel loss function, anti-focal loss, is introduced to encourage fast convergence in a small-scale, unbalanced data setting. Extensive experiments on aishell-l dataset show that our model outperforms state-of-the-art approaches under limited support data, in terms of convergence speed and generalization ability.-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshAutomatic speech recognition system-
dc.subject.meshEncoder-decoder architecture-
dc.subject.meshFew-shot learning-
dc.subject.meshLearning models-
dc.subject.meshPrediction tasks-
dc.subject.meshSequential prediction-
dc.subject.meshSmall scale-
dc.subject.meshSmall-scale data-
dc.subject.meshTransformer-
dc.subject.meshTransformer modeling-
dc.titleAnti-focal loss for speech recognition on small-scale datasets-
dc.typeConference-
dc.citation.conferenceDate2021.8.20. ~ 2021.8.22.-
dc.citation.conferenceName4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021-
dc.citation.edition2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021-
dc.citation.endPage22-
dc.citation.startPage19-
dc.citation.title2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021-
dc.identifier.bibliographicCitation2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021, pp.19-22-
dc.identifier.doi10.1109/prai53619.2021.9550804-
dc.identifier.scopusid2-s2.0-85117960182-
dc.identifier.urlhttp://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9550757-
dc.subject.keywordFew-shot learning-
dc.subject.keywordSmall-scale data-
dc.subject.keywordSpeech recognition-
dc.subject.keywordTransformer-
dc.type.otherConference Paper-
dc.description.isoafalse-
dc.subject.subareaArtificial Intelligence-
dc.subject.subareaComputer Vision and Pattern Recognition-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Chung, Tae-Sun Image
Chung, Tae-Sun정태선
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.