A new semantic attribute deep learning with a linguistic attribute hierarchy for spam detection
Open Access
- 1 May 2017
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 3862-3869
- https://doi.org/10.1109/ijcnn.2017.7966343
Abstract
The massive increase of spam is posing a very serious threat to email and SMS, which have become an important means of communication. Not only do spams annoy users, but they also become a security threat. Machine learning techniques have been widely used for spam detection. In this paper, we propose another form of deep learning, a linguistic attribute hierarchy, embedded with linguistic decision trees, for spam detection, and examine the effect of semantic attributes on the spam detection, represented by the linguistic attribute hierarchy. A case study on the SMS message database from the UCI machine learning repository has shown that a linguistic attribute hierarchy embedded with linguistic decision trees provides a transparent approach to in-depth analysing attribute impact on spam detection. This approach can not only efficiently tackle `curse of dimensionality' in spam detection with massive attributes, but also improve the performance of spam detection when the semantic attributes are constructed to a proper hierarchy.Keywords
This publication has 8 references indexed in Scilit:
- Incremental information gain analysis of input attribute impact on RBF-kernel SVM spam detectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- Survey of review spam detection using machine learning techniquesJournal of Big Data, 2015
- A cascade of linguistic CMAC neural networks for decision makingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Improving Knowledge Based Spam Detection Methods: The Effect of Malicious Related Features in Imbalance Data DistributionInternational Journal of Communications, Network and System Sciences, 2015
- An Analysis of Machine Learning Methods for Spam Host DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Contributions to the study of SMS spam filteringPublished by Association for Computing Machinery (ACM) ,2011
- Beyond PageRankPublished by Association for Computing Machinery (ACM) ,2006
- Induction of decision treesMachine Learning, 1986