Stream Classification with Recurring and Novel Class Detection Using Class-Based Ensemble
- 1 December 2012
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Concept-evolution has recently received a lot of attention in the context of mining data streams. Concept-evolution occurs when a new class evolves in the stream. Although many recent studies address this issue, most of them do not consider the scenario of recurring classes in the stream. A class is called recurring if it appears in the stream, disappears for a while, and then reappears again. Existing data stream classification techniques either misclassify the recurring class instances as another class, or falsely identify the recurring classes as novel. This increases the prediction error of the classifiers, and in some cases causes unnecessary waste in memory and computational resources. In this paper we address the recurring class issue by proposing a novel "class-based" ensemble technique, which substitutes the traditional "chunk-based" ensemble approaches and correctly distinguishes between a recurring class and a novel one. We analytically and experimentally confirm the superiority of our method over state-of-the-art techniques.Keywords
This publication has 15 references indexed in Scilit:
- Detecting Recurring and Novel Classes in Concept-Drifting Data StreamsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Enabling fast prediction for ensemble models on data streamsPublished by Association for Computing Machinery (ACM) ,2011
- Classifier and Cluster Ensembles for Mining Concept Drifting Data StreamsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Classification and Novel Class Detection in Concept-Drifting Data Streams under Time ConstraintsIEEE Transactions on Knowledge and Data Engineering, 2010
- Mining Data Streams with Labeled and Unlabeled Training ExamplesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Mining Concept-Drifting and Noisy Data Streams Using Ensemble ClassifiersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Adapted One-versus-All Decision Trees for Data Stream ClassificationIEEE Transactions on Knowledge and Data Engineering, 2008
- Cluster-based novel concept detection in data streams applied to intrusion detection in computer networksPublished by Association for Computing Machinery (ACM) ,2008
- Mining time-changing data streamsPublished by Association for Computing Machinery (ACM) ,2001
- Learning in the presence of concept drift and hidden contextsMachine Learning, 1996