Developing Two Different Novel Techniques for Arabic Text Stemming
Open Access
- 1 January 2019
- journal article
- Published by Scientific Research Publishing, Inc. in Intelligent Information Management
- Vol. 11 (01), 1-23
- https://doi.org/10.4236/iim.2019.111001
Abstract
Stemming is used to produce stem or root of words. The process is vital to different research fields such as text mining, sentiment analysis, and text categorization, etc. Several techniques have been proposed to stemming Arabic text and among them, Khoja and light-10 stemmers are the most widely used. In this paper, we propose and evaluate two different stemming techniques to Arabic that are based on light stemming techniques. The new stemmers are compared to best reported light stemmer, which is light-10. Results and experiments, which were conducted using standard collections, reveal that The proposed stemmers yield 5.13% and 13.1% improvement in retrieval performance over light 10 with 0.369 average precision and 0.397, respectively and the improvement is statistically significant.Keywords
This publication has 1 reference indexed in Scilit:
- Dictionary-based techniques for cross-language information retrievalInformation Processing & Management, 2005