Ajou University repository

IGRF-RFE: a hybrid feature selection method for MLP-based network intrusion detection on UNSW-NB15 datasetoa mark
  • Yin, Yuhua ;
  • Jang-Jaccard, Julian ;
  • Xu, Wen ;
  • Singh, Amardeep ;
  • Zhu, Jinting ;
  • Sabrina, Fariza ;
  • Kwak, Jin
Citations

SCOPUS

175

Citation Export

Publication Year
2023-12-01
Publisher
Springer Science and Business Media Deutschland GmbH
Citation
Journal of Big Data, Vol.10
Mesh Keyword
Feature selection methodsFeature subsetFilter methodHybrid feature selectionsInformation gainMultilayers perceptronsRandom forestsRecursive feature eliminationSubset searchesWrapper methods
All Science Classification Codes (ASJC)
Information SystemsHardware and ArchitectureComputer Networks and CommunicationsInformation Systems and Management
Abstract
The effectiveness of machine learning models can be significantly averse to redundant and irrelevant features present in the large dataset which can cause drastic performance degradation. This paper proposes IGRF-RFE: a hybrid feature selection method tasked for multi-class network anomalies using a multilayer perceptron (MLP) network. IGRF-RFE exploits the qualities of both a filter method for its speed and a wrapper method for its relevance search. In the first phase of our approach, we use a combination of two filter methods, information gain (IG) and random forest (RF) respectively, to reduce the feature subset search space. By combining these two filter methods, the influence of less important features but with the high-frequency values selected by IG is more effectively managed by RF resulting in more relevant features to be included in the feature subset search space. In the second phase of our approach, we use a machine learning-based wrapper method that provides a recursive feature elimination (RFE) to further reduce feature dimensions while taking into account the relevance of similar features. Our experimental results obtained based on the UNSW-NB15 dataset confirmed that our proposed method can improve the accuracy of anomaly detection as it can select more relevant features while reducing the feature space. The results show that the feature is reduced from 42 to 23 while the multi-classification accuracy of MLP is improved from 82.25% to 84.24%.
ISSN
2196-1115
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/33234
DOI
https://doi.org/10.1186/s40537-023-00694-8
Fulltext

Type
Article
Funding
This work is supported by the Cyber Security Research Programme-Artificial Intelligence for Automating Response to Threats from the Ministry of Business, Innovation, and Employment (MBIE) of New Zealand as a part of the Catalyst Strategy Funds under the Grant Number MAUX1912.
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

KWAK, JIN Image
KWAK, JIN곽진
Department of Cyber Security
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.