Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Dong Gi | - |
dc.contributor.author | Kim, Myungjun | - |
dc.contributor.author | Shin, Hyunjung | - |
dc.date.issued | 2022-01-01 | - |
dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/36790 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127589291&origin=inward | - |
dc.description.abstract | With the advent of easy access to a tremendous amount of text data, various studies utilizing text mining have been conducted in the biomedical field. However, most are only concerned with retrieving information solely from the perspective of either diseases or drugs. Extending from such boundary, we propose an approach of embedding disease and drugs from biomedical literature, determining direct relationships between them, and identifying possibilities of drug repositioning. To embed both disease and drugs, we utilize the word2vec algorithm and generate embedded word vectors for each disease and drug. Then hierarchical clustering with Ward's method is applied for categorization. Moreover, we suggest an evaluation measure that compares clusters from the text data with results from the molecular biology level. The proposed method was applied to 17,606,652 MEDLINE abstracts and extracted 4,163 diseases and 3,930 drugs. By examining heterogeneous clusters in which both disease and drug exist, nine candidate drugs were deduced for each disease in combination with 79 diseases and 84 drugs. The results are expected to serve as a baseline for the preliminary selection of candidate drugs for drug repositioning. | - |
dc.description.sponsorship | ACKNOWLEDGMENT The authors would like to gratefully acknowledge supported from the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021R1A2C2003474), BK21 FOUR program of the National Research Foundation of Korea funded by the Ministry of Education (NRF5199991014091) and the Ajou University research fund. | - |
dc.language.iso | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.subject.mesh | Candidate drugs | - |
dc.subject.mesh | Clusterings | - |
dc.subject.mesh | Disease-drug clustering | - |
dc.subject.mesh | Drug repositioning | - |
dc.subject.mesh | Embeddings | - |
dc.subject.mesh | Text data | - |
dc.subject.mesh | Text-mining | - |
dc.subject.mesh | Word embedding | - |
dc.subject.mesh | Word representations | - |
dc.subject.mesh | Word2vec | - |
dc.title | Drug Repositioning with Disease-Drug Clusters from Word Representations | - |
dc.type | Conference | - |
dc.citation.conferenceDate | 2022.1.17. ~ 2022.1.20. | - |
dc.citation.conferenceName | 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022 | - |
dc.citation.edition | Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022 | - |
dc.citation.endPage | 189 | - |
dc.citation.startPage | 182 | - |
dc.citation.title | Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022 | - |
dc.identifier.bibliographicCitation | Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022, pp.182-189 | - |
dc.identifier.doi | 10.1109/bigcomp54360.2022.00043 | - |
dc.identifier.scopusid | 2-s2.0-85127589291 | - |
dc.identifier.url | http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9736461 | - |
dc.subject.keyword | Disease-drug clustering | - |
dc.subject.keyword | Drug repositioning | - |
dc.subject.keyword | Text mining | - |
dc.subject.keyword | Word embedding | - |
dc.subject.keyword | Word2vec | - |
dc.type.other | Conference Paper | - |
dc.description.isoa | false | - |
dc.subject.subarea | Artificial Intelligence | - |
dc.subject.subarea | Computer Science Applications | - |
dc.subject.subarea | Computer Vision and Pattern Recognition | - |
dc.subject.subarea | Information Systems and Management | - |
dc.subject.subarea | Health Informatics | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.