Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
}
TY - GEN
T1 - Correct pronunciation detection for classical Arabic phonemes using deep learning
AU - Alqadheeb, Fatimah
AU - Asif, Amna
AU - Ahmad, Hafiz Farooq
PY - 2021/5/24
Y1 - 2021/5/24
N2 - The pronunciation of the Arabic language is required all articulatory phonetics organs to formulate the correct sounds of a word. It is challenging for non-native Arabic speakers to learn to recite the Holy Quran with correct “Tajweed” rules and pronunciation. The limited contributions are made in the development of classic Arabic short vowels dataset that hinder the development of speech recognition system to facilitate the learner during their Holy Quran learning process. Therefore, it is required to have a collection of audio classic Arabic datasets that can help speech recognition and mispronouncing detection of the classic Arabic speech. In this paper, we aim to collect the classical Arabic alphabet with short vowels. Short vowels are an essential part of the Arabic language. One word of the Arabic language consists of at least one or two short vowels. First, we start with requirement gathering for the classical Arabic short vowels. Our primary focus is to record and process the collected audio dataset. The first release of the audio dataset collected consists of 2892 Arabic alphabet short vowels. A significant effort is applied in preprocessing of the dataset consisting of 84 classes of the Arabic alphabet short vowels. Then, the dataset is tested using a sequential convolution neural network (CNN) on 312 phonemes of the collected Arabic Alphabet /a/ "Alif" with short vowels. The result shows that CNN gives high testing accuracy of 100% and a loss of 0.27.
AB - The pronunciation of the Arabic language is required all articulatory phonetics organs to formulate the correct sounds of a word. It is challenging for non-native Arabic speakers to learn to recite the Holy Quran with correct “Tajweed” rules and pronunciation. The limited contributions are made in the development of classic Arabic short vowels dataset that hinder the development of speech recognition system to facilitate the learner during their Holy Quran learning process. Therefore, it is required to have a collection of audio classic Arabic datasets that can help speech recognition and mispronouncing detection of the classic Arabic speech. In this paper, we aim to collect the classical Arabic alphabet with short vowels. Short vowels are an essential part of the Arabic language. One word of the Arabic language consists of at least one or two short vowels. First, we start with requirement gathering for the classical Arabic short vowels. Our primary focus is to record and process the collected audio dataset. The first release of the audio dataset collected consists of 2892 Arabic alphabet short vowels. A significant effort is applied in preprocessing of the dataset consisting of 84 classes of the Arabic alphabet short vowels. Then, the dataset is tested using a sequential convolution neural network (CNN) on 312 phonemes of the collected Arabic Alphabet /a/ "Alif" with short vowels. The result shows that CNN gives high testing accuracy of 100% and a loss of 0.27.
KW - Audio processing
KW - Classic Arabic audio dataset
KW - Classical Arabic
KW - Deep learning
U2 - 10.1109/WIDSTAIF52235.2021.9430236
DO - 10.1109/WIDSTAIF52235.2021.9430236
M3 - Conference contribution/Paper
AN - SCOPUS:85107494930
SN - 9781665449496
T3 - 2021 International Conference of Women in Data Science at Taif University, WiDSTaif 2021
BT - 2021 International Conference of Women in Data Science at Taif University, WiDSTaif 2021
PB - IEEE
T2 - 2021 International Conference of Women in Data Science at Taif University, WiDSTaif 2021
Y2 - 30 March 2021 through 31 March 2021
ER -