Classifying skewed data streams based on reusing data
- 1 October 2010
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 4, V4-90-V4-93
- https://doi.org/10.1109/iccasm.2010.5620201
Abstract
Current research community on data streams mining focuses on mining balanced data streams. However, the skewed class distribution appears in many data streams applications. In this paper, we introduce the method of discovering concept drifting on skewed data streams and propose an algorithm for classifying skewed data streams based on reusing data, RDFCSDS (Reuse Data for Classifying Skewed Data Streams). We evaluate RDFCSDS algorithm on Moving Hyperplane data set. The experiment results show that the sampling method based on reusing data works better than the simple sampling method and cluster sampling method on skewed data streams with concept drifting.Keywords
This publication has 11 references indexed in Scilit:
- OcVFDTPublished by Association for Computing Machinery (ACM) ,2009
- Positive Unlabeled Learning for Data Stream ClassificationPublished by Society for Industrial & Applied Mathematics (SIAM) ,2009
- Mining Data Streams with Skewed Distribution by Static Classifier EnsemblePublished by Springer Science and Business Media LLC ,2009
- One-Class Classification of Text Streams with Concept DriftPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Classifying Data Streams with Skewed Class Distributions and Concept DriftsIEEE Internet Computing, 2008
- A General Framework for Mining Concept-Drifting Data Streams with Skewed DistributionsPublished by Society for Industrial & Applied Mathematics (SIAM) ,2007
- An introduction to ROC analysisPattern Recognition Letters, 2005
- Systematic data selection to mine concept-drifting data streamsPublished by Association for Computing Machinery (ACM) ,2004
- A study of the behavior of several methods for balancing machine learning training dataACM SIGKDD Explorations Newsletter, 2004
- EditorialACM SIGKDD Explorations Newsletter, 2004