Ajou University repository

Textual variations in social media text processing applications: challenges, solutions, and trendsoa mark
  • Khan, Jebran ;
  • Ahmad, Kashif ;
  • Jagatheesaperumal, Senthil Kumar ;
  • Sohn, Kyung Ah
Citations

SCOPUS

9

Citation Export

Publication Year
2025-03-01
Journal
Artificial Intelligence Review
Publisher
Springer Nature
Citation
Artificial Intelligence Review, Vol.58 No.3
Keyword
OOV wordsSocial mediaText normalizationText processing applicationsText variations
Mesh Keyword
Informal communicationOutof-vocabulary words (OOV)Pre-processing techniquesProcessing applicationsSocial mediaText NormalisationText processing applicationText variationText-processingWork analysis
All Science Classification Codes (ASJC)
Language and LinguisticsLinguistics and LanguageArtificial Intelligence
Abstract
Being an informal communication source, social media text is susceptible to several intentional and unintentional textual variations. These variations lead to various out-of-vocabulary (OOV) words, making social media text processing more challenging. This work analyses and discusses such challenges by providing a detailed overview of different sources of intentional and unintentional OOV words and associated challenges. We provide a detailed survey of pre-processing techniques, including traditional and application-specific methods proposed in the literature to handle intentional and unintentional textual variations, while highlighting their pros and cons. The paper analyses the implications of text normalization (standardization) in different social media text-processing applications. Moreover, the paper provides an overview of the recent challenges and trends in handling social media textual variations, and it is expected to provide a baseline for future research.
ISSN
1573-7462
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/38500
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85218162913&origin=inward
DOI
https://doi.org/10.1007/s10462-024-11071-z
Journal URL
https://www.springer.com/journal/10462
Type
Article
Funding
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS2023-00255968) grant and Grant RS-2021-II212068 (Artificial Intelligence Innovation Hub), supervised by the Institute for Information & Communications Technology Planning & Evaluation (IITP), and also by the National Research Foundation of Korea(NRF) grant (No. NRF2022R1A2C1007434). The publication fee has been paid by the BK21 FOUR program of the NRF of Korea, funded by the Ministry of Education (NRF5199991014091).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Sohn, Kyung-Ah Image
Sohn, Kyung-Ah손경아
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.