MULDASA - Research Portal | Lancaster University

Home > Research > Publications & Outputs > MULDASA

Computing and Communications

Electronic data

applsci-12-03806
Rights statement: Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Final published version, 591 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.3390/app12083806
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

Arabic, NLP, Sentiment analysis, Arabic NLP, corpus linguistics, computational linguistics, natural language processing, machine learning, deep learning

View graph of relations

MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Ghadah Alwakid
Taha Osman
Mahmoud El-Haj
Saad Alanazi
Mamoona Humayun
Najm Us Sama

More...

Article number	3806
<mark>Journal publication date</mark>	9/04/2022
<mark>Journal</mark>	Applied Sciences
Issue number	8
Volume	12
Number of pages	18
Publication Status	Published
<mark>Original language</mark>	English

Abstract

The semantically complicated Arabic natural vocabulary, and the shortage of available techniques and skills to capture Arabic emotions from text hinder Arabic sentiment analysis (ASA). Evaluating Arabic idioms that do not follow a conventional linguistic framework, such as contemporary standard Arabic (MSA), complicates an incredibly difficult procedure. Here, we define a novel lexical sentiment analysis approach for studying Arabic language tweets (TTs) from specialized digital media platforms. Many elements comprising emoji, intensifiers, negations, and other nonstandard expressions such as supplications, proverbs, and interjections are incorporated into the MULDASA algorithm to enhance the precision of opinion classifications. Root words in multidialectal sentiment LX are associated with emotions found in the content under study via a simple stemming procedure. Furthermore, a feature–sentiment correlation procedure is incorporated into the proposed technique to exclude viewpoints expressed that seem to be irrelevant to the area of concern. As part of our research into Saudi Arabian employability, we compiled a large sample of TTs in 6 different Arabic dialects. This research shows that this sentiment categorization method is useful, and that using all of the characteristics listed earlier improves the ability to accurately classify people’s feelings. The classification accuracy of the proposed algorithm improved from 83.84% to 89.80%. Our approach also outperformed two existing research projects that employed a lexical approach for the sentiment analysis of Saudi dialects

Research

Electronic data

Links

Text available via DOI:

Keywords

MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us