Ajou University repository

Enhancing Voice Phishing Detection Using Multilingual Back-Translation and SMOTE: An Empirical Studyoa mark
Citations

SCOPUS

3

Citation Export

Publication Year
2025-01-01
Journal
IEEE Access
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
IEEE Access, Vol.13, pp.37946-37965
Keyword
Back-translationdata augmentationmachine learningnatural language processingSMOTEvoice phishing
Mesh Keyword
Back translationsData augmentationLanguage processingMachine-learningNatural language processingNatural languagesPerformancePhishingSynthetic minority over-sampling techniquesVoice phishing
All Science Classification Codes (ASJC)
Computer Science (all)Materials Science (all)Engineering (all)
Abstract
With the widespread global trend of voice phishing or vishing attacks, the development of effective detection models using artificial intelligence (AI) has been hindered by the lack of high-quality and large volumes of data. This lack of data reflecting a real vishing scenario often leads to imbalanced datasets and biased detection models. Therefore, we present in this paper a data augmentation (DA) method for expanding the imbalanced Korean call content vishing (KorCCVi) dataset to address the existing data asymmetry problem and enhance the performance of Korean vishing detection. The proposed approach for DA involves using the back-translation (BT) method with three different intermediate languages: English, Chinese, and Japanese. The proposed method offers several advantages over the traditional synthetic minority oversampling technique (SMOTE), which is the main technique used to compare with our multilingual BT approach. Using these two DA techniques, several machine learning (ML) and deep learning (DL) models were trained on the original imbalanced dataset, the dataset balanced with SMOTE and its variants, and the dataset augmented with our method. We analyzed the impact of these DA methods on the performance of the models, demonstrated the benefits of each approach, and suggested the most suitable approach. The performance of the trained models was evaluated using the accuracy, precision, recall, and F1-score metrics. The experimental results demonstrated that the proposed multilingual BT method effectively expands the dataset while preserving its contextual and linguistic characteristics. The average performance of the models revealed that those trained on the augmented dataset outperformed the other models. They achieved F1-scores of 98.91% for the back-translated data, 98.14% for the original data, and 97.23% for SMOTE.
ISSN
2169-3536
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/38549
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=86000720385&origin=inward
DOI
https://doi.org/10.1109/access.2025.3545250
Journal URL
http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639
Type
Article
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

HAMANDAWANA, PRINCE Image
HAMANDAWANA, PRINCEHAMANDAWANA PRINCE
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.