Ajou University repository

Neural Machine Translation with an Awareness of Semantic Similarity
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorLi, Jiaxin-
dc.contributor.authorJin, Rize-
dc.contributor.authorPaik, Joon Young-
dc.contributor.authorChung, Tae Sun-
dc.date.issued2024-01-01-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/37110-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85177459829&origin=inward-
dc.description.abstractMachine translation requires that source and target sentences have identical semantics. Previous neural machine translation (NMT) models have implicitly achieved this requirement using cross-entropy loss. In this paper, we propose a sentence Semantic-aware Machine Translation model (SaMT) which explicitly addresses the issue of semantic similarity between sentences in translation. SaMT integrates a Sentence-Transformer into a Transformer-based encoder-decoder to estimate semantic similarity between source and target sentences. Our model enables translated sentences to maintain the semantics of source sentences, either by using the Sentence-Transformer alone or by including an additional linear layer in the decoder. To achieve high-quality translation, we employ vertical and horizontal feature fusion methods, which capture rich features from sentences during translation. Experimental results showed a BLEU score of 36.41 on the IWSLT2014 German→ English dataset, validating the efficacy of incorporating sentence-level semantic knowledge and using the two orthogonal fusion methods. Our code is available at https://github.com/aaa559/SaMT-master.-
dc.description.sponsorshipThis research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2021-0-02051) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation).-
dc.language.isoeng-
dc.publisherSpringer Science and Business Media Deutschland GmbH-
dc.subject.meshCross entropy-
dc.subject.meshEntropy loss-
dc.subject.meshFusion mechanism-
dc.subject.meshMachine translation models-
dc.subject.meshMachine translations-
dc.subject.meshMulti-branch attention-
dc.subject.meshSemantic similarity-
dc.subject.meshSemantic-aware-
dc.subject.meshSentence semantic-aware-
dc.subject.meshTransformer-
dc.titleNeural Machine Translation with an Awareness of Semantic Similarity-
dc.typeConference-
dc.citation.conferenceDate2023.11.15. ~ 2023.11.19.-
dc.citation.conferenceName20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023-
dc.citation.editionPRICAI 2023: Trends in Artificial Intelligence - 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023, Proceedings-
dc.citation.endPage235-
dc.citation.startPage223-
dc.citation.titleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
dc.citation.volume14326 LNAI-
dc.identifier.bibliographicCitationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.14326 LNAI, pp.223-235-
dc.identifier.doi10.1007/978-981-99-7022-3_20-
dc.identifier.scopusid2-s2.0-85177459829-
dc.identifier.urlhttps://www.springer.com/series/558-
dc.subject.keywordFusion Mechanism-
dc.subject.keywordMulti-branch Attention-
dc.subject.keywordSentence Semantic-aware-
dc.subject.keywordTransformer-
dc.type.otherConference Paper-
dc.description.isoafalse-
dc.subject.subareaTheoretical Computer Science-
dc.subject.subareaComputer Science (all)-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Chung, Tae-Sun Image
Chung, Tae-Sun정태선
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.