Citation Export
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Khan, Jebran | - |
| dc.contributor.author | Ahmad, Kashif | - |
| dc.contributor.author | Jagatheesaperumal, Senthil Kumar | - |
| dc.contributor.author | Sohn, Kyung Ah | - |
| dc.date.issued | 2025-03-01 | - |
| dc.identifier.issn | 1573-7462 | - |
| dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/38500 | - |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85218162913&origin=inward | - |
| dc.description.abstract | Being an informal communication source, social media text is susceptible to several intentional and unintentional textual variations. These variations lead to various out-of-vocabulary (OOV) words, making social media text processing more challenging. This work analyses and discusses such challenges by providing a detailed overview of different sources of intentional and unintentional OOV words and associated challenges. We provide a detailed survey of pre-processing techniques, including traditional and application-specific methods proposed in the literature to handle intentional and unintentional textual variations, while highlighting their pros and cons. The paper analyses the implications of text normalization (standardization) in different social media text-processing applications. Moreover, the paper provides an overview of the recent challenges and trends in handling social media textual variations, and it is expected to provide a baseline for future research. | - |
| dc.description.sponsorship | This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2024-RS2023-00255968) grant and Grant RS-2021-II212068 (Artificial Intelligence Innovation Hub), supervised by the Institute for Information & Communications Technology Planning & Evaluation (IITP), and also by the National Research Foundation of Korea(NRF) grant (No. NRF2022R1A2C1007434). The publication fee has been paid by the BK21 FOUR program of the NRF of Korea, funded by the Ministry of Education (NRF5199991014091). | - |
| dc.language.iso | eng | - |
| dc.publisher | Springer Nature | - |
| dc.subject.mesh | Informal communication | - |
| dc.subject.mesh | Outof-vocabulary words (OOV) | - |
| dc.subject.mesh | Pre-processing techniques | - |
| dc.subject.mesh | Processing applications | - |
| dc.subject.mesh | Social media | - |
| dc.subject.mesh | Text Normalisation | - |
| dc.subject.mesh | Text processing application | - |
| dc.subject.mesh | Text variation | - |
| dc.subject.mesh | Text-processing | - |
| dc.subject.mesh | Work analysis | - |
| dc.title | Textual variations in social media text processing applications: challenges, solutions, and trends | - |
| dc.type | Article | - |
| dc.citation.number | 3 | - |
| dc.citation.title | Artificial Intelligence Review | - |
| dc.citation.volume | 58 | - |
| dc.identifier.bibliographicCitation | Artificial Intelligence Review, Vol.58 No.3 | - |
| dc.identifier.doi | 10.1007/s10462-024-11071-z | - |
| dc.identifier.scopusid | 2-s2.0-85218162913 | - |
| dc.identifier.url | https://www.springer.com/journal/10462 | - |
| dc.subject.keyword | OOV words | - |
| dc.subject.keyword | Social media | - |
| dc.subject.keyword | Text normalization | - |
| dc.subject.keyword | Text processing applications | - |
| dc.subject.keyword | Text variations | - |
| dc.type.other | Article | - |
| dc.identifier.pissn | 02692821 | - |
| dc.description.isoa | true | - |
| dc.subject.subarea | Language and Linguistics | - |
| dc.subject.subarea | Linguistics and Language | - |
| dc.subject.subarea | Artificial Intelligence | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.