Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ren, Jiakai | - |
dc.contributor.author | Jin, Rize | - |
dc.contributor.author | Chung, Tae Sun | - |
dc.date.issued | 2021-08-20 | - |
dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/36714 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85117960182&origin=inward | - |
dc.description.abstract | Deep learning models with encoder-decoder architecture become popular in automatic speech recognition systems, due to their success in sequential prediction tasks. Recently, the conformer model has greatly improved the accuracy of speech recognition. However, similar to transformer models, its training relies on a large amount of data. This paper explores an efficient few-shot learning strategy. Specifically, a spec-augment approach is proposed to augment the speech dataset, then a novel loss function, anti-focal loss, is introduced to encourage fast convergence in a small-scale, unbalanced data setting. Extensive experiments on aishell-l dataset show that our model outperforms state-of-the-art approaches under limited support data, in terms of convergence speed and generalization ability. | - |
dc.language.iso | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.subject.mesh | Automatic speech recognition system | - |
dc.subject.mesh | Encoder-decoder architecture | - |
dc.subject.mesh | Few-shot learning | - |
dc.subject.mesh | Learning models | - |
dc.subject.mesh | Prediction tasks | - |
dc.subject.mesh | Sequential prediction | - |
dc.subject.mesh | Small scale | - |
dc.subject.mesh | Small-scale data | - |
dc.subject.mesh | Transformer | - |
dc.subject.mesh | Transformer modeling | - |
dc.title | Anti-focal loss for speech recognition on small-scale datasets | - |
dc.type | Conference | - |
dc.citation.conferenceDate | 2021.8.20. ~ 2021.8.22. | - |
dc.citation.conferenceName | 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021 | - |
dc.citation.edition | 2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021 | - |
dc.citation.endPage | 22 | - |
dc.citation.startPage | 19 | - |
dc.citation.title | 2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021 | - |
dc.identifier.bibliographicCitation | 2021 4th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2021, pp.19-22 | - |
dc.identifier.doi | 10.1109/prai53619.2021.9550804 | - |
dc.identifier.scopusid | 2-s2.0-85117960182 | - |
dc.identifier.url | http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9550757 | - |
dc.subject.keyword | Few-shot learning | - |
dc.subject.keyword | Small-scale data | - |
dc.subject.keyword | Speech recognition | - |
dc.subject.keyword | Transformer | - |
dc.type.other | Conference Paper | - |
dc.description.isoa | false | - |
dc.subject.subarea | Artificial Intelligence | - |
dc.subject.subarea | Computer Vision and Pattern Recognition | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.