Multiclass From Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches
- 6 August 2013
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks and Learning Systems
- Vol. 25 (2), 289-302
- https://doi.org/10.1109/tnnls.2013.2274735
Abstract
Recently, there has been a lot of success in the development of effective binary classifiers. Although many statistical classification techniques have natural multiclass extensions, some, such as the support vector machines, do not. The existing techniques for mapping multiclass problems onto a set of simpler binary classification problems run into serious efficiency problems when there are hundreds or even thousands of classes, and these are the scenarios where this paper's contributions shine. We introduce the concept of correlation and joint probability of base binary learners. We learn these properties during the training stage, group the binary leaner's based on their independence and, with a Bayesian approach, combine the results to predict the class of a new instance. Finally, we also discuss two additional strategies: one to reduce the number of required base learners in the multiclass classification, and another to find new base learners that might best complement the existing set. We use these two new procedures iteratively to complement the initial solution and improve the overall performance. This paper has two goals: finding the most discriminative binary classifiers to solve a multiclass problem and keeping up the efficiency, i.e., small number of base learners. We validate and compare the method with a diverse set of methods of the literature in several public available datasets that range from small (10 to 26 classes) to large multiclass problems (1000 classes) always using simple reproducible scenarios.Keywords
Funding Information
- São Paulo Research Foundation¿FAPESP (2010/05647-4)
- National Counsel of Technological and Scientific Development¿CNPq (307018/2010-5, 304352/2012-8)
- Microsoft
This publication has 39 references indexed in Scilit:
- An extensive experimental comparison of methods for multi-label learningPattern Recognition, 2012
- Classification of DNA sequences using Bloom filtersBioinformatics, 2010
- Pattern Recognition and Machine LearningPublished by Springer Science and Business Media LLC ,2006
- New Results on Error Correcting Output Codes of Kernel MachinesIEEE Transactions on Neural Networks, 2004
- Coding and decoding strategies for multi-class learning problemsInformation Fusion, 2003
- Normalized cuts and image segmentationIeee Transactions On Pattern Analysis and Machine Intelligence, 2000
- The Error Coding Method and PICTsJournal of Computational and Graphical Statistics, 1998
- No free lunch theorems for optimizationIEEE Transactions on Evolutionary Computation, 1997
- Support-vector networksMachine Learning, 1995
- Individual Comparisons by Ranking MethodsBiometrics Bulletin, 1945