Bilingual word-embedding for korean and english without word alignments

Mesh Keyword: Bilingual tasks Cross-lingual Different origins Preprocessing time Sentence level Word alignment Word orders

All Science Classification Codes (ASJC): Industrial and Manufacturing Engineering

Abstract: In spite of lots of cross-lingual word embedding models for various languages, approaches that support cross-lingual word embedding between languages that have different word order and different origin word are lacking. In this study, we address the problem of cross-lingual word embedding between Korean and English that have different word order and origin and perform experiments to examine its performance behavior. Cross-lingual models have different levels of supervision. For training between languages which have different word order, it is essential to reduce preprocessing time. Therefore, two sentence-level alignment cross-lingual models are chosen for our experiments. Our results show that cross-lingual embedding for Korean and English without word-alignment is possible. We also analyze which bilingual tasks are proper for each trained result by comparing characteristic of each model’s trained result.

URI: https://aurora.ajou.ac.kr/handle/2018.oak/36248
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85022180479&origin=inward

Funding: This research was supported by the MISP (Ministry of Science, ICT & Future Planning), Korea, under the National Program for Excellence in SW) supervised by the IITP (Institute for Information & communications Technology Promotion) (R22151610020001002).

qrcode