Accepted author manuscript, 407 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version, 450 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License
Final published version
Licence: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Developing an Arabic Infectious Disease Ontology to Include Non-Standard Terminology. / Alsudias, Lama; Rayson, Paul.
12th International Conference on Language Resources and Evaluation: LREC2020. Paris : European Language Resources Association (ELRA), 2020. p. 4842-4850.Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
}
TY - GEN
T1 - Developing an Arabic Infectious Disease Ontology to Include Non-Standard Terminology
AU - Alsudias, Lama
AU - Rayson, Paul
PY - 2020/5/11
Y1 - 2020/5/11
N2 - Building ontologies is a crucial part of the semantic web endeavour. In recent years, research interest has grown rapidly in supporting languages such as Arabic in NLP in general but there has been very little research on medical ontologies for Arabic. We present a new Arabic ontology in the infectious disease domain to support various important applications including the monitoring of infectious disease spread via social media. This ontology meaningfully integrates the scientific vocabularies of infectious diseases with their informal equivalents. We use ontology learning strategies with manual checking to build the ontology. We applied three statistical methods for term extraction from selected Arabic infectious diseases articles: TF-IDF, C-value, and YAKE. We also conducted a study, by consulting around 100 individuals, to discover the informal terms related to infectious diseases in Arabic. In future work, we will automatically extract the relations for infectious disease concepts but for now these are manually created. We report two complementary experiments to evaluate the ontology. First, a quantitative evaluation of the term extraction results and an additional qualitative evaluation by a domain expert.
AB - Building ontologies is a crucial part of the semantic web endeavour. In recent years, research interest has grown rapidly in supporting languages such as Arabic in NLP in general but there has been very little research on medical ontologies for Arabic. We present a new Arabic ontology in the infectious disease domain to support various important applications including the monitoring of infectious disease spread via social media. This ontology meaningfully integrates the scientific vocabularies of infectious diseases with their informal equivalents. We use ontology learning strategies with manual checking to build the ontology. We applied three statistical methods for term extraction from selected Arabic infectious diseases articles: TF-IDF, C-value, and YAKE. We also conducted a study, by consulting around 100 individuals, to discover the informal terms related to infectious diseases in Arabic. In future work, we will automatically extract the relations for infectious disease concepts but for now these are manually created. We report two complementary experiments to evaluate the ontology. First, a quantitative evaluation of the term extraction results and an additional qualitative evaluation by a domain expert.
M3 - Conference contribution/Paper
SN - 9791095546344
SP - 4842
EP - 4850
BT - 12th International Conference on Language Resources and Evaluation
PB - European Language Resources Association (ELRA)
CY - Paris
T2 - The 12th Edition of the Language Resources and Evaluation Conference (LREC2020)
Y2 - 11 May 2020 through 16 May 2020
ER -