Learning Classifier System Based on Mean of Reward

Abstract

This paper focuses on the generalization of classifiers in noisy problems and aims at construction learning classifier system (LCS) that can acquire the optimal classifier subset by dynamically determining the classifier generalization criteria. In this paper, an accuracy-based LCS (XCS) that uses the mean of the reward (XCS-MR) is introduced, which can correctly identify classifiers as either accurate or inaccurate for noisy problems, and investigates its effectiveness when used for several noisy problems. Applying XCS and an XCS based on the variance of reward (XCS-VR) as the conventional LCSs, along with XCS-MR, to noisy 11-multiplexer problems where the reward value changes according to a Gaussian distribution, Cauchy distribution, and lognormal distribution revealed the following: (1) XCS-VR and XCS-MR could select the correct action for every type of reward distribution; (2) XCS-MR could appropriately generalize the classifiers with the smallest amount of data; and (3) XCS-MR could acquire the optimal classifier subset in every trial for every type of reward distribution.

Keywords

This publication has 8 references indexed in Scilit:

Variance-based Learning Classifier System without Convergence of Reward Estimation
Published by Association for Computing Machinery (ACM) ,2016
Rule reduction by selection strategy in XCS with adaptive action map
Evolutionary Intelligence, 2015
Towards generalization by identification-based XCS in multi-steps problem
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Toward a Theory of Generalization and Learning in XCS
IEEE Transactions on Evolutionary Computation, 2004
Strength or Accuracy: Credit Assignment in Learning Classifier Systems
Published by Springer Science and Business Media LLC ,2004
An Analysis of Generalization in the XCS Classifier System
Evolutionary Computation, 1999
Classifier Fitness Based on Accuracy
Evolutionary Computation, 1995
Learning to predict by the methods of temporal differences
Machine Learning, 1988

Cited by 2 articles