A New Method for Data Stream Mining Based on the Misclassification Error
- 16 July 2014
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks and Learning Systems
- Vol. 26 (5), 1048-1059
- https://doi.org/10.1109/tnnls.2014.2333557
Abstract
In this paper, a new method for constructing decision trees for stream data is proposed. First a new splitting criterion based on the misclassification error is derived. A theorem is proven showing that the best attribute computed in considered node according to the available data sample is the same, with some high probability, as the attribute derived from the whole infinite data stream. Next this result is combined with the splitting criterion based on the Gini index. It is shown that such combination provides the highest accuracy among all studied algorithms.Keywords
Funding Information
- Foundation for Polish Science Team Programme through the European Union, European Regional Development Fund, Operational Programme Innovative Economy 2007-2013
This publication has 25 references indexed in Scilit:
- COMPOSE: A Semisupervised Learning Framework for Initially Labeled Nonstationary Streaming DataIEEE Transactions on Neural Networks and Learning Systems, 2013
- PANFIS: A Novel Incremental Learning MachineIEEE Transactions on Neural Networks and Learning Systems, 2013
- A survey on learning from data streams: current and future trendsProgress in Artificial Intelligence, 2012
- Handling drifts and shifts in on-line data streams with evolving fuzzy systemsApplied Soft Computing, 2011
- Stream mining: a novel architecture for ensemble-based classificationKnowledge and Information Systems, 2011
- A hybrid decision tree training method using data streamsKnowledge and Information Systems, 2010
- Ambiguous decision trees for mining concept-drifting data streamsPattern Recognition Letters, 2009
- Evolving fuzzy classifiers using different model architecturesFuzzy Sets and Systems, 2008
- Mining data streamsACM SIGMOD Record, 2005
- Evolving fuzzy neural networks for supervised/unsupervised online knowledge-based learningIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2001