Optimization of Hyperparameters and Feature Selection for Random Forests and Support Vector Machines by Artificial Bee Colony Algorithm
- 1 March 2019
- journal article
- Published by Japanese Society for Artificial Intelligence in Transactions of the Japanese Society for Artificial Intelligence
- Vol. 34 (2), G-I36_1-I36_1
- https://doi.org/10.1527/tjsai.g-i36
Abstract
Hyperparameters optimization for learning algorithms and feature selection from given data are key issues in machine learning, would greatly affect classification accuracy. Random forests and support vector machines are among some of the most popular learning algorithms. Random forests that have relatively few hyperparameters, can perform more accurate classification by optimizing these parameters without requirement of feature selection. Same as random forests, support vector machines also have a few hyperparameters. However, whether or not to perform feature selection at the same time as optimization of these parameters greatly affects classification accuracy. Usually, grid search method is used to optimize hyperparameters. However, since this search method is performed on predetermined grids, the detailed optimization cannot be realized. In this paper, we thus introduce an artificial bee colony (ABC) algorithm to optimize hyperparameters and to perform more accurate feature selection. ABC algorithm is one of the swarm intelligence algorithms used to solve optimization problems which is inspired by the foraging behaviour of the honey bees. Using KDD Cup 1999 Data that is a benchmark of network intrusion detection classification, experimental results demonstrate the effectiveness of our method. The proposed method is superior in classification accuracies to existing methods for the same data, where swarm intelligence is used to hyperparameters optimization and feature selection. Our method also shows better performance than classification accuracies of random forests and SVM that are learned using default parameters values provided by scikit-learn, an open source machine learning library for Python.Keywords
This publication has 14 references indexed in Scilit:
- An intelligent algorithm with feature selection and decision rules applied to anomaly intrusion detectionApplied Soft Computing, 2012
- Swarm intelligence in intrusion detection: A surveyComputers & Security, 2011
- The use of computational intelligence in intrusion detection systems: A reviewApplied Soft Computing, 2010
- A comparative study of Artificial Bee Colony algorithmApplied Mathematics and Computation, 2009
- Particle swarm optimization for parameter determination and feature selection of support vector machinesExpert Systems with Applications, 2008
- Random-Forests-Based Network Intrusion Detection SystemsIEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2008
- On the performance of artificial bee colony (ABC) algorithmApplied Soft Computing, 2008
- A new optimizer using particle swarm theoryPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Random ForestsMachine Learning, 2001
- Support-vector networksMachine Learning, 1995