Home > Research > Publications & Outputs > MULDASA

Electronic data

  • applsci-12-03806

    Rights statement: Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

    Final published version, 591 KB, PDF document

    Available under license: CC BY: Creative Commons Attribution 4.0 International License

Links

Text available via DOI:

View graph of relations

MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media. / Alwakid, Ghadah; Osman, Taha; El-Haj, Mahmoud et al.
In: Applied Sciences, Vol. 12, No. 8, 3806, 09.04.2022.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Alwakid, G, Osman, T, El-Haj, M, Alanazi, S, Humayun, M & Us Sama, N 2022, 'MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media', Applied Sciences, vol. 12, no. 8, 3806. https://doi.org/10.3390/app12083806

APA

Alwakid, G., Osman, T., El-Haj, M., Alanazi, S., Humayun, M., & Us Sama, N. (2022). MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media. Applied Sciences, 12(8), Article 3806. https://doi.org/10.3390/app12083806

Vancouver

Alwakid G, Osman T, El-Haj M, Alanazi S, Humayun M, Us Sama N. MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media. Applied Sciences. 2022 Apr 9;12(8):3806. doi: 10.3390/app12083806

Author

Alwakid, Ghadah ; Osman, Taha ; El-Haj, Mahmoud et al. / MULDASA : Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media. In: Applied Sciences. 2022 ; Vol. 12, No. 8.

Bibtex

@article{04f1a82e877047cfadac6b5392684543,
title = "MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media",
abstract = "The semantically complicated Arabic natural vocabulary, and the shortage of available techniques and skills to capture Arabic emotions from text hinder Arabic sentiment analysis (ASA). Evaluating Arabic idioms that do not follow a conventional linguistic framework, such as contemporary standard Arabic (MSA), complicates an incredibly difficult procedure. Here, we define a novel lexical sentiment analysis approach for studying Arabic language tweets (TTs) from specialized digital media platforms. Many elements comprising emoji, intensifiers, negations, and other nonstandard expressions such as supplications, proverbs, and interjections are incorporated into the MULDASA algorithm to enhance the precision of opinion classifications. Root words in multidialectal sentiment LX are associated with emotions found in the content under study via a simple stemming procedure. Furthermore, a feature–sentiment correlation procedure is incorporated into the proposed technique to exclude viewpoints expressed that seem to be irrelevant to the area of concern. As part of our research into Saudi Arabian employability, we compiled a large sample of TTs in 6 different Arabic dialects. This research shows that this sentiment categorization method is useful, and that using all of the characteristics listed earlier improves the ability to accurately classify people{\textquoteright}s feelings. The classification accuracy of the proposed algorithm improved from 83.84% to 89.80%. Our approach also outperformed two existing research projects that employed a lexical approach for the sentiment analysis of Saudi dialects",
keywords = "Arabic, NLP, Sentiment analysis, Arabic NLP, corpus linguistics, computational linguistics, natural language processing, machine learning, deep learning",
author = "Ghadah Alwakid and Taha Osman and Mahmoud El-Haj and Saad Alanazi and Mamoona Humayun and {Us Sama}, Najm",
year = "2022",
month = apr,
day = "9",
doi = "10.3390/app12083806",
language = "English",
volume = "12",
journal = "Applied Sciences",
issn = "2076-3417",
publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",
number = "8",

}

RIS

TY - JOUR

T1 - MULDASA

T2 - Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media

AU - Alwakid, Ghadah

AU - Osman, Taha

AU - El-Haj, Mahmoud

AU - Alanazi, Saad

AU - Humayun, Mamoona

AU - Us Sama, Najm

PY - 2022/4/9

Y1 - 2022/4/9

N2 - The semantically complicated Arabic natural vocabulary, and the shortage of available techniques and skills to capture Arabic emotions from text hinder Arabic sentiment analysis (ASA). Evaluating Arabic idioms that do not follow a conventional linguistic framework, such as contemporary standard Arabic (MSA), complicates an incredibly difficult procedure. Here, we define a novel lexical sentiment analysis approach for studying Arabic language tweets (TTs) from specialized digital media platforms. Many elements comprising emoji, intensifiers, negations, and other nonstandard expressions such as supplications, proverbs, and interjections are incorporated into the MULDASA algorithm to enhance the precision of opinion classifications. Root words in multidialectal sentiment LX are associated with emotions found in the content under study via a simple stemming procedure. Furthermore, a feature–sentiment correlation procedure is incorporated into the proposed technique to exclude viewpoints expressed that seem to be irrelevant to the area of concern. As part of our research into Saudi Arabian employability, we compiled a large sample of TTs in 6 different Arabic dialects. This research shows that this sentiment categorization method is useful, and that using all of the characteristics listed earlier improves the ability to accurately classify people’s feelings. The classification accuracy of the proposed algorithm improved from 83.84% to 89.80%. Our approach also outperformed two existing research projects that employed a lexical approach for the sentiment analysis of Saudi dialects

AB - The semantically complicated Arabic natural vocabulary, and the shortage of available techniques and skills to capture Arabic emotions from text hinder Arabic sentiment analysis (ASA). Evaluating Arabic idioms that do not follow a conventional linguistic framework, such as contemporary standard Arabic (MSA), complicates an incredibly difficult procedure. Here, we define a novel lexical sentiment analysis approach for studying Arabic language tweets (TTs) from specialized digital media platforms. Many elements comprising emoji, intensifiers, negations, and other nonstandard expressions such as supplications, proverbs, and interjections are incorporated into the MULDASA algorithm to enhance the precision of opinion classifications. Root words in multidialectal sentiment LX are associated with emotions found in the content under study via a simple stemming procedure. Furthermore, a feature–sentiment correlation procedure is incorporated into the proposed technique to exclude viewpoints expressed that seem to be irrelevant to the area of concern. As part of our research into Saudi Arabian employability, we compiled a large sample of TTs in 6 different Arabic dialects. This research shows that this sentiment categorization method is useful, and that using all of the characteristics listed earlier improves the ability to accurately classify people’s feelings. The classification accuracy of the proposed algorithm improved from 83.84% to 89.80%. Our approach also outperformed two existing research projects that employed a lexical approach for the sentiment analysis of Saudi dialects

KW - Arabic

KW - NLP

KW - Sentiment analysis

KW - Arabic NLP

KW - corpus linguistics

KW - computational linguistics

KW - natural language processing

KW - machine learning

KW - deep learning

U2 - 10.3390/app12083806

DO - 10.3390/app12083806

M3 - Journal article

VL - 12

JO - Applied Sciences

JF - Applied Sciences

SN - 2076-3417

IS - 8

M1 - 3806

ER -