Final published version, 288 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Licence: CC BY: Creative Commons Attribution 4.0 International License
Research output: Working paper › Preprint
Research output: Working paper › Preprint
}
TY - UNPB
T1 - Transformer-based Detection of Multiword Expressions in Flower and Plant Names
AU - Premasiri, Damith
AU - Haddad, Amal Haddad
AU - Ranasinghe, Tharindu
AU - Mitkov, Ruslan
N1 - Submitted to The 5th Workshop on Multi-word Units in Machine Translation and Translation Technology at Europhras2022
PY - 2022/9/20
Y1 - 2022/9/20
N2 - Multiword expression (MWE) is a sequence of words which collectively present a meaning which is not derived from its individual words. The task of processing MWEs is crucial in many natural language processing (NLP) applications, including machine translation and terminology extraction. Therefore, detecting MWEs in different domains is an important research topic. In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names. We evaluate different transformer models on a dataset created from Encyclopedia of Plants and Flower. We empirically show that transformer models outperform the previous neural models based on long short-term memory (LSTM).
AB - Multiword expression (MWE) is a sequence of words which collectively present a meaning which is not derived from its individual words. The task of processing MWEs is crucial in many natural language processing (NLP) applications, including machine translation and terminology extraction. Therefore, detecting MWEs in different domains is an important research topic. In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names. We evaluate different transformer models on a dataset created from Encyclopedia of Plants and Flower. We empirically show that transformer models outperform the previous neural models based on long short-term memory (LSTM).
KW - cs.CL
M3 - Preprint
BT - Transformer-based Detection of Multiword Expressions in Flower and Plant Names
PB - Arxiv
ER -