2018 World Congress in Computer Science, Computer Engineering and Applied Computing, CSCE 2018 - Proceedings of the 2018 International Conference on Artificial Intelligence, ICAI 2018
Publisher
CSREA Press
Citation
2018 World Congress in Computer Science, Computer Engineering and Applied Computing, CSCE 2018 - Proceedings of the 2018 International Conference on Artificial Intelligence, ICAI 2018, pp.468-471
Documents are often physically damaged in part. Even if you save a damaged document online, it will be hard to recover it. Therefore, this study aims to develop an artificial intelligence methodology that can guess and recover lost words in documents. This study demonstrates that the Skip-Gram model can be used to construct relationships between words using the word embedding technique, a simplified form of the neural network language model. It also shows that the relationship between words can be used to infer words to fit in a specific part of a lost document.
This research was partially supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2016R1D1A1B03934129) and partially supported by the MSIP (Ministry of Science and ICT) under ICT R&D program (2017-0-01672) supervised by the IITP (Institute for Information & communications Technology Promotion).