Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, Jiaxin | - |
dc.contributor.author | Jin, Rize | - |
dc.contributor.author | Paik, Joon Young | - |
dc.contributor.author | Chung, Tae Sun | - |
dc.date.issued | 2024-01-01 | - |
dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/37110 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85177459829&origin=inward | - |
dc.description.abstract | Machine translation requires that source and target sentences have identical semantics. Previous neural machine translation (NMT) models have implicitly achieved this requirement using cross-entropy loss. In this paper, we propose a sentence Semantic-aware Machine Translation model (SaMT) which explicitly addresses the issue of semantic similarity between sentences in translation. SaMT integrates a Sentence-Transformer into a Transformer-based encoder-decoder to estimate semantic similarity between source and target sentences. Our model enables translated sentences to maintain the semantics of source sentences, either by using the Sentence-Transformer alone or by including an additional linear layer in the decoder. To achieve high-quality translation, we employ vertical and horizontal feature fusion methods, which capture rich features from sentences during translation. Experimental results showed a BLEU score of 36.41 on the IWSLT2014 German→ English dataset, validating the efficacy of incorporating sentence-level semantic knowledge and using the two orthogonal fusion methods. Our code is available at https://github.com/aaa559/SaMT-master. | - |
dc.description.sponsorship | This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2021-0-02051) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation). | - |
dc.language.iso | eng | - |
dc.publisher | Springer Science and Business Media Deutschland GmbH | - |
dc.subject.mesh | Cross entropy | - |
dc.subject.mesh | Entropy loss | - |
dc.subject.mesh | Fusion mechanism | - |
dc.subject.mesh | Machine translation models | - |
dc.subject.mesh | Machine translations | - |
dc.subject.mesh | Multi-branch attention | - |
dc.subject.mesh | Semantic similarity | - |
dc.subject.mesh | Semantic-aware | - |
dc.subject.mesh | Sentence semantic-aware | - |
dc.subject.mesh | Transformer | - |
dc.title | Neural Machine Translation with an Awareness of Semantic Similarity | - |
dc.type | Conference | - |
dc.citation.conferenceDate | 2023.11.15. ~ 2023.11.19. | - |
dc.citation.conferenceName | 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023 | - |
dc.citation.edition | PRICAI 2023: Trends in Artificial Intelligence - 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023, Proceedings | - |
dc.citation.endPage | 235 | - |
dc.citation.startPage | 223 | - |
dc.citation.title | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | - |
dc.citation.volume | 14326 LNAI | - |
dc.identifier.bibliographicCitation | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.14326 LNAI, pp.223-235 | - |
dc.identifier.doi | 10.1007/978-981-99-7022-3_20 | - |
dc.identifier.scopusid | 2-s2.0-85177459829 | - |
dc.identifier.url | https://www.springer.com/series/558 | - |
dc.subject.keyword | Fusion Mechanism | - |
dc.subject.keyword | Multi-branch Attention | - |
dc.subject.keyword | Sentence Semantic-aware | - |
dc.subject.keyword | Transformer | - |
dc.type.other | Conference Paper | - |
dc.description.isoa | false | - |
dc.subject.subarea | Theoretical Computer Science | - |
dc.subject.subarea | Computer Science (all) | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.