Arabic dialect sentiment analysis with ZERO effort. \\ Case study: Algerian dialect

Open Access

15 May 2020

journal article
research article
Published by IBERAMIA: Sociedad Iberoamericana de Inteligencia Artificial in INTELIGENCIA ARTIFICIAL

Vol. 23 (65), 124-135
https://doi.org/10.4114/intartif.vol23iss65pp124-135

Abstract

This paper presents an analytic study showing that it is entirely possible to analyze the sentiment of an Arabic dialect without constructing any resources. The idea of this work is to use the resources dedicated to a given dialect \textit{X} for analyzing the sentiment of another dialect \textit{Y}. The unique condition is to have \textit{X} and \textit{Y} in the same category of dialects. We apply this idea on Algerian dialect, which is a Maghrebi Arabic dialect that suffers from limited available tools and other handling resources required for automatic sentiment analysis. To do this analysis, we rely on Maghrebi dialect resources and two manually annotated sentiment corpus for respectively Tunisian and Moroccan dialect. We also use a large corpus for Maghrebi dialect. We use a state-of-the-art system and propose a new deep learning architecture for automatically classify the sentiment of Arabic dialect (Algerian dialect). Experimental results show that F1-score is up to 83% and it is achieved by Multilayer Perceptron (MLP) with Tunisian corpus and with Long short-term memory (LSTM) with the combination of Tunisian and Moroccan. An improvement of 15% compared to its closest competitor was observed through this study. Ongoing work is aimed at manually constructing an annotated sentiment corpus for Algerian dialect and comparing the results

Keywords

This publication has 7 references indexed in Scilit:

Identification of genetic markers associated with milk production traits in Chinese Holstein cattle based on post genome-wide association studies
Animal Biotechnology, 2019
Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text
Procedia Computer Science, 2017
AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets
Procedia Computer Science, 2017
Arabic Dialect Identification
Computational Linguistics, 2014
Supervised Sequence Labelling with Recurrent Neural Networks
Published by Springer Science and Business Media LLC ,2012
OCA: Opinion corpus for Arabic
Journal of the American Society for Information Science and Technology, 2011
A Survey on Transfer Learning
IEEE Transactions on Knowledge and Data Engineering, 2009

Cited by 5 articles