Ajou University repository

Textual variations in social media text processing applications: challenges, solutions, and trendsoa mark
  • Khan, Jebran ;
  • Ahmad, Kashif ;
  • Jagatheesaperumal, Senthil Kumar ;
  • Sohn, Kyung Ah
Citations

SCOPUS

9

Citation Export

DC Field Value Language
dc.contributor.authorKhan, Jebran-
dc.contributor.authorAhmad, Kashif-
dc.contributor.authorJagatheesaperumal, Senthil Kumar-
dc.contributor.authorSohn, Kyung Ah-
dc.date.issued2025-03-01-
dc.identifier.issn1573-7462-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/38500-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85218162913&origin=inward-
dc.description.abstractBeing an informal communication source, social media text is susceptible to several intentional and unintentional textual variations. These variations lead to various out-of-vocabulary (OOV) words, making social media text processing more challenging. This work analyses and discusses such challenges by providing a detailed overview of different sources of intentional and unintentional OOV words and associated challenges. We provide a detailed survey of pre-processing techniques, including traditional and application-specific methods proposed in the literature to handle intentional and unintentional textual variations, while highlighting their pros and cons. The paper analyses the implications of text normalization (standardization) in different social media text-processing applications. Moreover, the paper provides an overview of the recent challenges and trends in handling social media textual variations, and it is expected to provide a baseline for future research.-
dc.description.sponsorshipThis research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS2023-00255968) grant and Grant RS-2021-II212068 (Artificial Intelligence Innovation Hub), supervised by the Institute for Information & Communications Technology Planning & Evaluation (IITP), and also by the National Research Foundation of Korea(NRF) grant (No. NRF2022R1A2C1007434). The publication fee has been paid by the BK21 FOUR program of the NRF of Korea, funded by the Ministry of Education (NRF5199991014091).-
dc.language.isoeng-
dc.publisherSpringer Nature-
dc.subject.meshInformal communication-
dc.subject.meshOutof-vocabulary words (OOV)-
dc.subject.meshPre-processing techniques-
dc.subject.meshProcessing applications-
dc.subject.meshSocial media-
dc.subject.meshText Normalisation-
dc.subject.meshText processing application-
dc.subject.meshText variation-
dc.subject.meshText-processing-
dc.subject.meshWork analysis-
dc.titleTextual variations in social media text processing applications: challenges, solutions, and trends-
dc.typeArticle-
dc.citation.number3-
dc.citation.titleArtificial Intelligence Review-
dc.citation.volume58-
dc.identifier.bibliographicCitationArtificial Intelligence Review, Vol.58 No.3-
dc.identifier.doi10.1007/s10462-024-11071-z-
dc.identifier.scopusid2-s2.0-85218162913-
dc.identifier.urlhttps://www.springer.com/journal/10462-
dc.subject.keywordOOV words-
dc.subject.keywordSocial media-
dc.subject.keywordText normalization-
dc.subject.keywordText processing applications-
dc.subject.keywordText variations-
dc.type.otherArticle-
dc.identifier.pissn02692821-
dc.description.isoatrue-
dc.subject.subareaLanguage and Linguistics-
dc.subject.subareaLinguistics and Language-
dc.subject.subareaArtificial Intelligence-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Sohn, Kyung-Ah Image
Sohn, Kyung-Ah손경아
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.