Title: A random forest algorithm under the ensemble approach for feature selection and classification

Authors: Ankit Kharwar; Devendra Thakor

Addresses: Computer Engineering, Chhotubhai Gopalbhai Patel Institute of Technology, Uka Tarsadia University, Bardoli, Gujarat, India ' Computer Engineering, Chhotubhai Gopalbhai Patel Institute of Technology, Uka Tarsadia University, Bardoli, Gujarat, India

Abstract: Over the years, research analysts have proposed diverse intrusion detection systems' (IDS) tactics to manage the increasing number and complexity of computer threats. IDS takes all the data over the network and analyses the data using machine learning for finding the attacks. It is tough to find attacks on the network because it contains fewer records than standard data. It is significantly challenging to design an IDS for high accuracy. It also foregrounds different feature selection methods to select the best feature subset. We use the random forest feature importance for finding the best features. Single classifiers can mislead the find result, so we use random forest as classification with the help of best features. The proposed model is assessed on standard datasets like KDD'99, NSL-KDD, and UNSW-NB15. The experimental result shows that the proposed model outperforms the existing methods in terms of accuracy, detection rate, and false alarm rate.

Keywords: intrusion detection; anomaly detection; machine learning; ensemble methods; random forest; feature selection; feature importance; classification; cybersecurity; network security.

DOI: 10.1504/IJCNDS.2023.131737

International Journal of Communication Networks and Distributed Systems, 2023 Vol.29 No.4, pp.426 - 447

Received: 19 Apr 2022
Accepted: 12 May 2022

Published online: 30 Jun 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article