Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Chapter
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Chapter
}
TY - CHAP
T1 - Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools?
AU - Ranasinghe, Tharindu
AU - Mitkov, Ruslan
AU - Orasan, Constantin
AU - Quintana, Rocío Caro
PY - 2021/12/8
Y1 - 2021/12/8
N2 - This study proposes an original methodology to underpin the operation of new generation Translation Memory (TM) systems where the translations to be retrieved from the TM database are matched not on the basis of Levenshtein (edit) distance but by employing innovative Natural Language Processing (NLP) and Deep Learning (DL) techniques. Three DL sentence encoders were experimented with to retrieve TM matches in English-Spanish sentence pairs from the DGT TM dataset. Each sentence encoder was compared with Okapi which uses edit distance to retrieve the best match. 1 The automatic evaluation shows the benefit of the DL technology for TM matching and holds promise for the implementation of the TM tool itself, which is our next project.
AB - This study proposes an original methodology to underpin the operation of new generation Translation Memory (TM) systems where the translations to be retrieved from the TM database are matched not on the basis of Levenshtein (edit) distance but by employing innovative Natural Language Processing (NLP) and Deep Learning (DL) techniques. Three DL sentence encoders were experimented with to retrieve TM matches in English-Spanish sentence pairs from the DGT TM dataset. Each sentence encoder was compared with Okapi which uses edit distance to retrieve the best match. 1 The automatic evaluation shows the benefit of the DL technology for TM matching and holds promise for the implementation of the TM tool itself, which is our next project.
KW - Deep learning
KW - Machine translation
KW - Okapi
KW - Semantic similarity
KW - Textual similarity
KW - Translation memory
U2 - 10.1075/btl.158.04ran
DO - 10.1075/btl.158.04ran
M3 - Chapter
T3 - Benjamins Translation Library
SP - 101
EP - 124
BT - Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
A2 - Lavid-Lopez, Julia
A2 - Maiz-Arevalo, Carmen
A2 - Zamorano-Mansilla, Juan Rafael
PB - John Benjamins
ER -