A new semantic attribute deep learning with a linguistic attribute hierarchy for spam detection

Open Access

1 May 2017

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 3862-3869
https://doi.org/10.1109/ijcnn.2017.7966343

Abstract

The massive increase of spam is posing a very serious threat to email and SMS, which have become an important means of communication. Not only do spams annoy users, but they also become a security threat. Machine learning techniques have been widely used for spam detection. In this paper, we propose another form of deep learning, a linguistic attribute hierarchy, embedded with linguistic decision trees, for spam detection, and examine the effect of semantic attributes on the spam detection, represented by the linguistic attribute hierarchy. A case study on the SMS message database from the UCI machine learning repository has shown that a linguistic attribute hierarchy embedded with linguistic decision trees provides a transparent approach to in-depth analysing attribute impact on spam detection. This approach can not only efficiently tackle `curse of dimensionality' in spam detection with massive attributes, but also improve the performance of spam detection when the semantic attributes are constructed to a proper hierarchy.

Keywords

This publication has 8 references indexed in Scilit:

Incremental information gain analysis of input attribute impact on RBF-kernel SVM spam detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2016
Survey of review spam detection using machine learning techniques
Journal of Big Data, 2015
A cascade of linguistic CMAC neural networks for decision making
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Improving Knowledge Based Spam Detection Methods: The Effect of Malicious Related Features in Imbalance Data Distribution
International Journal of Communications, Network and System Sciences, 2015
An Analysis of Machine Learning Methods for Spam Host Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2012
Contributions to the study of SMS spam filtering
Published by Association for Computing Machinery (ACM) ,2011
Beyond PageRank
Published by Association for Computing Machinery (ACM) ,2006
Induction of decision trees
Machine Learning, 1986

Cited by 14 articles