Use of contextual information for feature ranking and discretization

1 January 1997

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering

Vol. 9 (5), 718-730
https://doi.org/10.1109/69.634751

Abstract

Deriving classification rules or decision trees from examples is an important problem. When there are too many features, discarding weak features before the derivation process is highly desirable. When there are numeric features, they need to be discretized for the rule generation. We present a new approach to these problems. Traditional techniques make use of feature merits based on either the information theoretic, or the statistical correlation between each feature and the class. We instead assign merits to features by finding each feature's "obligation" to the class discrimination in the context of other features. The merits are then used to rank the features, select a feature subset, and discretize the numeric variables. Experience with benchmark example sets demonstrates that the new approach is a powerful alternative to the traditional methods. This paper concludes by posing some new technical issues that arise from this approach.

Keywords

This publication has 14 references indexed in Scilit:

Overcoming the Myopia of Inductive Learning Algorithms with RELIEFF
Applied Intelligence, 1997
Supervised and Unsupervised Discretization of Continuous Features
Published by Elsevier BV ,1995
Learning Boolean concepts in the presence of many irrelevant features
Artificial Intelligence, 1994
Optimized rule induction
IEEE Expert, 1993
Feature selection based on the structural indices of categories
Pattern Recognition, 1993
Consecutive interval query and dynamic programming on intervals
Lecture Notes in Computer Science, 1993
Dynamic programming on intervals
Lecture Notes in Computer Science, 1991
The CN2 induction algorithm
Machine Learning, 1989
Exploratory Projection Pursuit
Journal of the American Statistical Association, 1987
Graph-Based Algorithms for Boolean Function Manipulation
IEEE Transactions on Computers, 1986

Cited by 72 articles