Arabic dialect sentiment analysis with ZERO effort. \\ Case study: Algerian dialect
Open Access
- 15 May 2020
- journal article
- research article
- Published by IBERAMIA: Sociedad Iberoamericana de Inteligencia Artificial in INTELIGENCIA ARTIFICIAL
- Vol. 23 (65), 124-135
- https://doi.org/10.4114/intartif.vol23iss65pp124-135
Abstract
This paper presents an analytic study showing that it is entirely possible to analyze the sentiment of an Arabic dialect without constructing any resources. The idea of this work is to use the resources dedicated to a given dialect \textit{X} for analyzing the sentiment of another dialect \textit{Y}. The unique condition is to have \textit{X} and \textit{Y} in the same category of dialects. We apply this idea on Algerian dialect, which is a Maghrebi Arabic dialect that suffers from limited available tools and other handling resources required for automatic sentiment analysis. To do this analysis, we rely on Maghrebi dialect resources and two manually annotated sentiment corpus for respectively Tunisian and Moroccan dialect. We also use a large corpus for Maghrebi dialect. We use a state-of-the-art system and propose a new deep learning architecture for automatically classify the sentiment of Arabic dialect (Algerian dialect). Experimental results show that F1-score is up to 83% and it is achieved by Multilayer Perceptron (MLP) with Tunisian corpus and with Long short-term memory (LSTM) with the combination of Tunisian and Moroccan. An improvement of 15% compared to its closest competitor was observed through this study. Ongoing work is aimed at manually constructing an annotated sentiment corpus for Algerian dialect and comparing the resultsKeywords
This publication has 7 references indexed in Scilit:
- Identification of genetic markers associated with milk production traits in Chinese Holstein cattle based on post genome-wide association studiesAnimal Biotechnology, 2019
- Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic TextProcedia Computer Science, 2017
- AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi TweetsProcedia Computer Science, 2017
- Arabic Dialect IdentificationComputational Linguistics, 2014
- Supervised Sequence Labelling with Recurrent Neural NetworksPublished by Springer Science and Business Media LLC ,2012
- OCA: Opinion corpus for ArabicJournal of the American Society for Information Science and Technology, 2011
- A Survey on Transfer LearningIEEE Transactions on Knowledge and Data Engineering, 2009