Improvement of the KTDA Algorithm for the Visualization of Semantic Network

DC Field Value Language
dc.contributor.advisor권순선-
dc.contributor.author표성인-
dc.date.accessioned2025-01-25T01:36:10Z-
dc.date.available2025-01-25T01:36:10Z-
dc.date.issued2023-02-
dc.identifier.other32405-
dc.identifier.urihttps://dspace.ajou.ac.kr/handle/2018.oak/24695-
dc.description학위논문(석사)--수학과,2023. 2-
dc.description.tableofcontents1. Introduction 1 <br>2. KTDA Algorithm 3 <br> 2.1 TF-IDF 4 <br> 2.2 Semantic Network 5 <br> 2.3 Limitation of KTDA Algorithm 6 <br>3. KTDA-N Algorithm 7 <br> 3.1 Sparsity Cutoff Setting 7 <br> 3.2 FDR Controlling in Multiple Testing of Correlations 7 <br> 3.3 LDA Topic Modeling 9 <br> 3.4 Handling Isolates 12 <br> 3.5 KTDA-N Algorithm 13 <br>4. Analysis 14 <br> 4.1 Thyroid Cancer Data 15 <br> 4.2 Lack of Nurse Data 20 <br>5. Conclusion 23 <br>6. References 24 <br>Appendix 27-
dc.language.isoeng-
dc.publisherThe Graduate School, Ajou University-
dc.rights아주대학교 논문은 저작권에 의해 보호받습니다.-
dc.titleImprovement of the KTDA Algorithm for the Visualization of Semantic Network-
dc.typeThesis-
dc.contributor.affiliation아주대학교 대학원-
dc.contributor.department일반대학원 수학과-
dc.date.awarded2023-02-
dc.description.degreeMaster-
dc.identifier.localIdT000000032405-
dc.identifier.urlhttps://dcoll.ajou.ac.kr/dcollection/common/orgView/000000032405-
dc.subject.keywordCorrelation Testing-
dc.subject.keywordKorean Text Data Analysis-
dc.subject.keywordLDA Topic Modeling-
dc.subject.keywordSemantic Network-
dc.subject.keywordSparsity Cutoff-
dc.subject.keywordText Mining-
dc.subject.keywordVisualization-
dc.description.alternativeAbstractTextual data differs in the analysis method depending on its domain or various characteristics. The Korean Text Data Analysis Algorithm was presented to provide a pipeline for statistical analysis of Korean text for the above reasons. However, in the process of dimension reduction and correlation cutting, a cutoff setting with insufficient statistical inference was accompanied. The dense visualization result also weaken the interpretabiltiy of the plot. To improve the algorithm, this study presented statistical inference for word-to-word relationships using FDR(False Discovery Rate) control and improved dimension reduction and visualization by applying sparsity cutoff setting and LDA(Latent Dirichlet Allocation). New algorithm is expected to improve the reliability and interpretation of the results of analysis.-
Appears in Collections:
Graduate School of Ajou University > Department of Mathematics > 3. Theses(Master)
Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse