Analysis of Machine learning Techniques Used in Behavior-Based Malware Detection

Top Cited Papers

1 December 2010

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 201-203
https://doi.org/10.1109/act.2010.33

Abstract

The increase of malware that are exploiting the Internet daily has become a serious threat. The manual heuristic inspection of malware analysis is no longer considered effective and efficient compared against the high spreading rate of malware. Hence, automated behavior-based malware detection using machine learning techniques is considered a profound solution. The behavior of each malware on an emulated (sandbox) environment will be automatically analyzed and will generate behavior reports. These reports will be preprocessed into sparse vector models for further machine learning (classification). The classifiers used in this research are k-Nearest Neighbors (kNN), Naïve Bayes, J48 Decision Tree, Support Vector Machine (SVM), and Multilayer Perceptron Neural Network (MlP). Based on the analysis of the tests and experimental results of all the 5 classifiers, the overall best performance was achieved by J48 decision tree with a recall of 95.9%, a false positive rate of 2.4%, a precision of 97.3%, and an accuracy of 96.8%. In summary, it can be concluded that a proof-of-concept based on automatic behavior-based malware analysis and the use of machine learning techniques could detect malware quite effectively and efficiently.

Keywords

This publication has 2 references indexed in Scilit:

The WEKA data mining software
ACM SIGKDD Explorations Newsletter, 2009
Mining specifications of malicious behavior
Published by Association for Computing Machinery (ACM) ,2007

Cited by 179 articles