An Optimal Text Categorization Algorithm Based on SVM
- 1 June 2006
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2006 International Conference on Communications, Circuits and Systems
- Vol. 3, 2137-2140
- https://doi.org/10.1109/icccas.2006.284921
Abstract
Text categorization is the process of assigning documents to a set of previously fixed categories. In this paper we develop an optimal SVM algorithm for text classification via multiple optimal strategies, such as a novel importance weight definition, the feature selection using the likelihood ratio for binomial distribution, the optimal parameter settings, etc. Comparison between our method and other conventional text classification algorithms is conducted on Reuter and TREC corpora. The experimental results indicate that our proposed algorithm yields much better performance than other conventional algorithmsKeywords
This publication has 10 references indexed in Scilit:
- Machine learning in automated text categorizationACM Computing Surveys, 2002
- Robust Classification for Imprecise EnvironmentsMachine Learning, 2001
- A re-examination of text categorization methodsPublished by Association for Computing Machinery (ACM) ,1999
- An Evaluation of Statistical Approaches to Text CategorizationInformation Retrieval Journal, 1999
- Inductive learning algorithms and representations for text categorizationPublished by Association for Computing Machinery (ACM) ,1998
- Naive (Bayes) at forty: The independence assumption in information retrievalLecture Notes in Computer Science, 1998
- Feature selection, perception learning, and a usability case study for text categorizationPublished by Association for Computing Machinery (ACM) ,1997
- The Nature of Statistical Learning TheoryPublished by Springer Science and Business Media LLC ,1995
- An example-based mapping method for text categorization and retrievalACM Transactions on Information Systems, 1994
- Automated learning of decision rules for text categorizationACM Transactions on Information Systems, 1994