Home > Research > Publications & Outputs > An approach for pronunciation classification of...

Links

Text available via DOI:

View graph of relations

An approach for pronunciation classification of classical arabic phonemes using deep learning

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • Amna Asif
  • Hamid Mukhtar
  • Fatimah Alqadheeb
  • Hafiz Farooq Ahmad
  • Abdulaziz Alhumam
Close
Article number238
<mark>Journal publication date</mark>1/01/2022
<mark>Journal</mark>Applied Sciences (Switzerland)
Issue number1
Volume12
Publication StatusPublished
<mark>Original language</mark>English

Abstract

A mispronunciation of Arabic short vowels can change the meaning of a complete sentence. For this reason, both the students and teachers of Classical Arabic (CA) are required extra practice for correcting students’ pronunciation of Arabic short vowels. That makes the teaching and learning task cumbersome for both parties. An intelligent process of students’ evaluation can make learning and teaching easier for both students and teachers. Given that online learning has become a norm these days, modern learning requires assessment by virtual teachers. In our case, the task is about recognizing the exact pronunciation of Arabic alphabets according to the standards. A major challenge in the recognition of precise pronunciation of Arabic alphabets is the correct identification of a large number of short vowels, which cannot be dealt with using traditional statistical audio processing techniques and machine learning models. Therefore, we developed a model that classifies Arabic short vowels using Deep Neural Networks (DNN). The model is constructed from scratch by: (i) collecting a new audio dataset, (ii) developing a neural network architecture, and (iii) optimizing and fine‐tuning the developed model through several iterations to achieve high classification accuracy. Given a set of unseen audio samples of uttered short vowels, our proposed model has reached the testing accuracy of 95.77%. We can say that our results can be used by the experts and researchers for building better intelligent learning support systems in Arabic speech processing.