Ajou University repository

Enhancing Voice Phishing Detection Using Multilingual Back-Translation and SMOTE: An Empirical Studyoa mark
Citations

SCOPUS

3

Citation Export

DC Field Value Language
dc.contributor.authorBoussougou, Milandu Keith Moussavou-
dc.contributor.authorHamandawana, Prince-
dc.contributor.authorPark, Dong Joo-
dc.date.issued2025-01-01-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/38549-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=86000720385&origin=inward-
dc.description.abstractWith the widespread global trend of voice phishing or vishing attacks, the development of effective detection models using artificial intelligence (AI) has been hindered by the lack of high-quality and large volumes of data. This lack of data reflecting a real vishing scenario often leads to imbalanced datasets and biased detection models. Therefore, we present in this paper a data augmentation (DA) method for expanding the imbalanced Korean call content vishing (KorCCVi) dataset to address the existing data asymmetry problem and enhance the performance of Korean vishing detection. The proposed approach for DA involves using the back-translation (BT) method with three different intermediate languages: English, Chinese, and Japanese. The proposed method offers several advantages over the traditional synthetic minority oversampling technique (SMOTE), which is the main technique used to compare with our multilingual BT approach. Using these two DA techniques, several machine learning (ML) and deep learning (DL) models were trained on the original imbalanced dataset, the dataset balanced with SMOTE and its variants, and the dataset augmented with our method. We analyzed the impact of these DA methods on the performance of the models, demonstrated the benefits of each approach, and suggested the most suitable approach. The performance of the trained models was evaluated using the accuracy, precision, recall, and F1-score metrics. The experimental results demonstrated that the proposed multilingual BT method effectively expands the dataset while preserving its contextual and linguistic characteristics. The average performance of the models revealed that those trained on the augmented dataset outperformed the other models. They achieved F1-scores of 98.91% for the back-translated data, 98.14% for the original data, and 97.23% for SMOTE.-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshBack translations-
dc.subject.meshData augmentation-
dc.subject.meshLanguage processing-
dc.subject.meshMachine-learning-
dc.subject.meshNatural language processing-
dc.subject.meshNatural languages-
dc.subject.meshPerformance-
dc.subject.meshPhishing-
dc.subject.meshSynthetic minority over-sampling techniques-
dc.subject.meshVoice phishing-
dc.titleEnhancing Voice Phishing Detection Using Multilingual Back-Translation and SMOTE: An Empirical Study-
dc.typeArticle-
dc.citation.endPage37965-
dc.citation.startPage37946-
dc.citation.titleIEEE Access-
dc.citation.volume13-
dc.identifier.bibliographicCitationIEEE Access, Vol.13, pp.37946-37965-
dc.identifier.doi10.1109/access.2025.3545250-
dc.identifier.scopusid2-s2.0-86000720385-
dc.identifier.urlhttp://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639-
dc.subject.keywordBack-translation-
dc.subject.keyworddata augmentation-
dc.subject.keywordmachine learning-
dc.subject.keywordnatural language processing-
dc.subject.keywordSMOTE-
dc.subject.keywordvoice phishing-
dc.type.otherArticle-
dc.identifier.pissn21693536-
dc.description.isoatrue-
dc.subject.subareaComputer Science (all)-
dc.subject.subareaMaterials Science (all)-
dc.subject.subareaEngineering (all)-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

HAMANDAWANA, PRINCE Image
HAMANDAWANA, PRINCEHAMANDAWANA PRINCE
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.