Sentiment Analysis of Emirati Dialect
Open Access
- 17 May 2022
- journal article
- research article
- Published by MDPI AG in Big Data and Cognitive Computing
- Vol. 6 (2), 57
- https://doi.org/10.3390/bdcc6020057
Abstract
Recently, extensive studies and research in the Arabic Natural Language Processing (ANLP) field have been conducted for text classification and sentiment analysis. Moreover, the number of studies that target Arabic dialects has also increased. In this research paper, we constructed the first manually annotated dataset of the Emirati dialect for the Instagram platform. The constructed dataset consisted of more than 70,000 comments, mostly written in the Emirati dialect. We annotated the comments in the dataset based on text polarity, dividing them into positive, negative, and neutral categories, and the number of annotated comments was 70,000. Moreover, the dataset was also annotated for the dialect type, categorized into the Emirati dialect, Arabic dialects, and MSA. Preprocessing and TF-IDF features extraction approaches were applied to the constructed Emirati dataset to prepare the dataset for the sentiment analysis experiment and improve its classification performance. The sentiment analysis experiment was carried out on both balanced and unbalanced datasets using several machine learning classifiers. The evaluation metrics of the sentiment analysis experiments were accuracy, recall, precision, and f-measure. The results reported that the best accuracy result was 80.80%, and it was achieved when the ensemble model was applied for the sentiment classification of the unbalanced dataset.Keywords
This publication has 44 references indexed in Scilit:
- Geomatic Approaches for Modeling Land Change Scenarios. An IntroductionPublished by Springer Science and Business Media LLC ,2017
- An Enhanced Approach for Arabic Sentiment AnalysisInternational Journal of Artificial Intelligence & Applications, 2017
- Using Objective Words in the Reviews to Improve the Colloquial Arabic Sentiment AnalysisInternational Journal on Natural Language Computing, 2017
- Arabic Tweets Sentimental Analysis Using Machine LearningLecture Notes in Computer Science, 2017
- Sentiment Analysis of Tunisian Dialects: Linguistic Ressources and ExperimentsPublished by Association for Computational Linguistics (ACL) ,2017
- Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic TextProcedia Computer Science, 2017
- Arabic tweets sentiment analysis – a hybrid schemeJournal of Information Science, 2016
- Hierarchical Classifiers for Multi-Way Sentiment Analysis of Arabic ReviewsInternational Journal of Advanced Computer Science and Applications, 2016
- Sentiment analysis for dialectical ArabicPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- A Weighted Voting Classifier Based on Differential EvolutionAbstract and Applied Analysis, 2014