Improvement of the KTDA Algorithm for the Visualization of Semantic Network
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 권순선 | - |
dc.contributor.author | 표성인 | - |
dc.date.accessioned | 2025-01-25T01:36:10Z | - |
dc.date.available | 2025-01-25T01:36:10Z | - |
dc.date.issued | 2023-02 | - |
dc.identifier.other | 32405 | - |
dc.identifier.uri | https://dspace.ajou.ac.kr/handle/2018.oak/24695 | - |
dc.description | 학위논문(석사)--수학과,2023. 2 | - |
dc.description.tableofcontents | 1. Introduction 1 <br>2. KTDA Algorithm 3 <br> 2.1 TF-IDF 4 <br> 2.2 Semantic Network 5 <br> 2.3 Limitation of KTDA Algorithm 6 <br>3. KTDA-N Algorithm 7 <br> 3.1 Sparsity Cutoff Setting 7 <br> 3.2 FDR Controlling in Multiple Testing of Correlations 7 <br> 3.3 LDA Topic Modeling 9 <br> 3.4 Handling Isolates 12 <br> 3.5 KTDA-N Algorithm 13 <br>4. Analysis 14 <br> 4.1 Thyroid Cancer Data 15 <br> 4.2 Lack of Nurse Data 20 <br>5. Conclusion 23 <br>6. References 24 <br>Appendix 27 | - |
dc.language.iso | eng | - |
dc.publisher | The Graduate School, Ajou University | - |
dc.rights | 아주대학교 논문은 저작권에 의해 보호받습니다. | - |
dc.title | Improvement of the KTDA Algorithm for the Visualization of Semantic Network | - |
dc.type | Thesis | - |
dc.contributor.affiliation | 아주대학교 대학원 | - |
dc.contributor.department | 일반대학원 수학과 | - |
dc.date.awarded | 2023-02 | - |
dc.description.degree | Master | - |
dc.identifier.localId | T000000032405 | - |
dc.identifier.url | https://dcoll.ajou.ac.kr/dcollection/common/orgView/000000032405 | - |
dc.subject.keyword | Correlation Testing | - |
dc.subject.keyword | Korean Text Data Analysis | - |
dc.subject.keyword | LDA Topic Modeling | - |
dc.subject.keyword | Semantic Network | - |
dc.subject.keyword | Sparsity Cutoff | - |
dc.subject.keyword | Text Mining | - |
dc.subject.keyword | Visualization | - |
dc.description.alternativeAbstract | Textual data differs in the analysis method depending on its domain or various characteristics. The Korean Text Data Analysis Algorithm was presented to provide a pipeline for statistical analysis of Korean text for the above reasons. However, in the process of dimension reduction and correlation cutting, a cutoff setting with insufficient statistical inference was accompanied. The dense visualization result also weaken the interpretabiltiy of the plot. To improve the algorithm, this study presented statistical inference for word-to-word relationships using FDR(False Discovery Rate) control and improved dimension reduction and visualization by applying sparsity cutoff setting and LDA(Latent Dirichlet Allocation). New algorithm is expected to improve the reliability and interpretation of the results of analysis. | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.