Ajou University repository

Visualization algorithm based on FDR control testing for dimension reduction of textual data
Citations

SCOPUS

0

Citation Export

Publication Year
2025-04-11
Journal
Data Technologies and Applications
Publisher
Emerald Publishing
Citation
Data Technologies and Applications, Vol.59 No.2, pp.338-361
Keyword
Dimension reductionFalse discovery rate controlKorean text data analysisSemantic networkText miningVisualization
Mesh Keyword
Dimension reductionFalse discovery rateFalse discovery rate controlKorean text data analyzeRate controlsSemantics networksText dataText-miningTextual dataVisualization algorithms
All Science Classification Codes (ASJC)
Information SystemsLibrary and Information Sciences
Abstract
Purpose: Visualizing relations of textual data requires dimension reduction to increase the interpretability of output. However, traditional dimension reduction methods have some limitations, such as the loss of feature information during extraction or projection in dimension reduction and uncertain results due to the mixture of word labels. In this study, we develop the textual data visualization algorithm using statistical methods to present statistical inferences on the data. We also construct the algorithm in a way that the user can analyze textual data easily. Design/methodology/approach: Unstructured data, such as textual data, is sensitive to choosing analysis methods. In addition, textual data is generally large-sized and sparse. Considering such characteristics, we applied latent Dirichlet allocation to separate data to minimize the loss of information, and false discover rate (FDR) control to reduce dimension in a statistical way. Findings: The relation of textual data can be derived in a one-click way, and the output can be interpreted without background information, with separated topics. Originality/value: The algorithm is constructed based on the Korean language. However, any language can be used without linguistic information. This study can be an example of usage and flow, which using not well-known dimension reduction methods can replace traditional methods.
ISSN
2514-9318
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/38236
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105002314320&origin=inward
DOI
https://doi.org/10.1108/dta-04-2024-0373
Journal URL
https://www.emeraldinsight.com/loi/dta
Type
Article
Funding
Funding: This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2021R1A6A1A10044950 and NO.4299990414389, Ajou mathematical sciences team for future leaders).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Ahn, Soohyun Image
Ahn, Soohyun안수현
Department of Mathematics
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.