Improvement of the KTDA Algorithm for the Visualization of Semantic Network

표성인

DC Field	Value	Language
dc.contributor.advisor	권순선	-
dc.contributor.author	표성인	-
dc.date.issued	2023-02	-
dc.identifier.other	32405	-
dc.identifier.uri	https://aurora.ajou.ac.kr/handle/2018.oak/24695	-
dc.description	학위논문(석사)--수학과,2023. 2	-
dc.description.tableofcontents	1. Introduction 1 <br>2. KTDA Algorithm 3 <br> 2.1 TF-IDF 4 <br> 2.2 Semantic Network 5 <br> 2.3 Limitation of KTDA Algorithm 6 <br>3. KTDA-N Algorithm 7 <br> 3.1 Sparsity Cutoff Setting 7 <br> 3.2 FDR Controlling in Multiple Testing of Correlations 7 <br> 3.3 LDA Topic Modeling 9 <br> 3.4 Handling Isolates 12 <br> 3.5 KTDA-N Algorithm 13 <br>4. Analysis 14 <br> 4.1 Thyroid Cancer Data 15 <br> 4.2 Lack of Nurse Data 20 <br>5. Conclusion 23 <br>6. References 24 <br>Appendix 27	-
dc.language.iso	eng	-
dc.publisher	The Graduate School, Ajou University	-
dc.rights	아주대학교 논문은 저작권에 의해 보호받습니다.	-
dc.title	Improvement of the KTDA Algorithm for the Visualization of Semantic Network	-
dc.type	Thesis	-
dc.contributor.affiliation	아주대학교 대학원	-
dc.contributor.department	일반대학원 수학과	-
dc.date.awarded	2023-02	-
dc.description.degree	Master	-
dc.identifier.url	https://dcoll.ajou.ac.kr/dcollection/common/orgView/000000032405	-
dc.subject.keyword	Correlation Testing	-
dc.subject.keyword	Korean Text Data Analysis	-
dc.subject.keyword	LDA Topic Modeling	-
dc.subject.keyword	Semantic Network	-
dc.subject.keyword	Sparsity Cutoff	-
dc.subject.keyword	Text Mining	-
dc.subject.keyword	Visualization	-
dc.description.alternativeAbstract	Textual data differs in the analysis method depending on its domain or various characteristics. The Korean Text Data Analysis Algorithm was presented to provide a pipeline for statistical analysis of Korean text for the above reasons. However, in the process of dimension reduction and correlation cutting, a cutoff setting with insufficient statistical inference was accompanied. The dense visualization result also weaken the interpretabiltiy of the plot. To improve the algorithm, this study presented statistical inference for word-to-word relationships using FDR(False Discovery Rate) control and improved dimension reduction and visualization by applying sparsity cutoff setting and LDA(Latent Dirichlet Allocation). New algorithm is expected to improve the reliability and interpretation of the results of analysis.	-

Show simple item record

qrcode

트윗하기

Total Views & Downloads

File Download

There are no files associated with this item.