An Intelligent Outlier Detection Method With One Class Support Tucker Machine and Genetic Algorithm Toward Big Sensor Data in Internet of Things
- 9 August 2018
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Industrial Electronics
- Vol. 66 (6), 4672-4683
- https://doi.org/10.1109/tie.2018.2860568
Abstract
Various types of sensor data can be collected by IoT. Each sensor node has spatial attributes and may also be associated with a large number of measurement data that evolve over time; therefore, these high-dimensional sensor data are inherently large scale. Detecting outliers in large-scale IoT sensor data is a challenging task. Most existing anomaly detection methods are based on a vector representation. However, large-scale IoT sensor data have characteristics that make tensor methods more efficient for extracting information. The vector-based methods can destroy original structural information and correlation within large-scale sensor data, resulting in the problem of the “curse of dimensionality,” and some outliers hence cannot be detected. In this paper, we propose a one-class support Tucker machine (OCSTuM) and an OCSTuM based on tensor Tucker factorization and a genetic algorithm called GA-OCSTuM. These methods extend one-class support vector machines to tensor space. OCSTuM and GA-OCSTuM are unsupervised anomaly detection approaches for big sensor data. They retain the structural information of data while improving the accuracy and efficiency of anomaly detection. The experimental evaluations on real datasets demonstrate that our proposed method improves the accuracy and efficiency of anomaly detection while retaining the intrinsic structure of big sensor data.Keywords
Funding Information
- National Key R&D Program of China (2016YFC0201400)
- Provincial Key R&D Program of Zhejiang Province (2017C03019)
- National Natural Science Foundation of China (U1509217)
- International Science and Technology Cooperation Program of Zhejiang Province for Joint Research in High-tech Industry (2016C54007)
- Hunan Provincial Natural Science Foundation (2017JJ3252)
This publication has 24 references indexed in Scilit:
- A Low-Rank Approximation-Based Transductive Support Tensor Machine for Semisupervised ClassificationIEEE Transactions on Image Processing, 2015
- Limestone: High-throughput candidate phenotype generation via tensor factorizationJournal of Biomedical Informatics, 2014
- DuSK: A Dual Structure-preserving Kernel for Supervised Tensor Learning with Applications to NeuroimagesProceedings of the 2020 Siam International Conference on Data Mining, 2014
- Three-way compositional analysis of water quality monitoring dataEnvironmental and Ecological Statistics, 2014
- Quality relevant nonlinear batch process performance monitoring using a kernel based multiway non-Gaussian latent subspace projection approachJournal of Process Control, 2014
- Higher rank Support Tensor Machines for visual recognitionPattern Recognition, 2012
- A kernel-based framework to tensorial data analysisNeural Networks, 2011
- Tensor Decompositions and ApplicationsSIAM Review, 2009
- Supervised tensor learningKnowledge and Information Systems, 2007
- A GA-based feature selection and parameters optimizationfor support vector machinesExpert Systems with Applications, 2006