Improving AdaBoost-based Intrusion Detection System (IDS) Performance on CIC IDS 2017 Dataset
Open Access
- 1 March 2019
- journal article
- research article
- Published by IOP Publishing in Journal of Physics: Conference Series
- Vol. 1192 (1), 012018
- https://doi.org/10.1088/1742-6596/1192/1/012018
Abstract
This paper considers the use of Synthetic Minority Oversampling Technique (SMOTE), Principal Component Analysis (PCA), and Ensemble Feature Selection (EFS) to improve the performance of AdaBoost-based Intrusion Detection System (IDS) on the latest and challenging CIC IDS 2017 Dataset [1]. Previous research [1] has proposed the use of AdaBoost classifier to cope with the new dataset. However, due to several problems such as imbalance of training data and inappropriate selection of classification methods, the performance is still inferior. In this research, we aim at constructing an improvement performance intrusion detection approach to handle the imbalance of training data, SMOTE is selected to tackle the problem. Moreover, Principal Component Analysis (PCA) and Ensemble Feature Selection (EFS) are applied as the feature selection to select important attributes from the new dataset. The evaluation results show that the proposed AdaBoost classifier using PCA and SMOTE yields Area Under the Receiver Operating Characteristic curve (AUROC) of 92% and the AdaBoost classifier using EFS and SMOTE produces an accuracy, precision, recall, and F1 Score of 81.83 %, 81.83%, 100%, and 90.01% respectively.Keywords
This publication has 16 references indexed in Scilit:
- Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic CharacterizationPublished by INSTICC ,2018
- EFS: an ensemble feature selection tool implemented as R-package and web-application.BioData Mining, 2017
- An improved adaboost algorithm for imbalanced data based on weighted KNNPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2017
- Ensemble SVM classifiers based on PCA and LDA for IDSPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- Intrusion detection model based on ensemble learning for U2R and R2L attacksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Intrusion Detection Using Random Forests Classifier with SMOTE and Feature ReductionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Intrusion Detection using Naive Bayes Classifier with Feature ReductionProcedia Technology, 2012
- What is principal component analysis?Nature Biotechnology, 2008
- An empirical comparison of supervised learning algorithmsPublished by Association for Computing Machinery (ACM) ,2006
- A Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingJournal of Computer and System Sciences, 1997