Home > Research > Publications & Outputs > Semantic textual similarity based on deep learn...

Links

Text available via DOI:

View graph of relations

Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools?

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter

Published

Standard

Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools? / Ranasinghe, Tharindu; Mitkov, Ruslan; Orasan, Constantin et al.
Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations. ed. / Julia Lavid-Lopez; Carmen Maiz-Arevalo; Juan Rafael Zamorano-Mansilla. John Benjamins, 2021. p. 101-124 (Benjamins Translation Library; Vol. 158).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter

Harvard

Ranasinghe, T, Mitkov, R, Orasan, C & Quintana, RC 2021, Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools? in J Lavid-Lopez, C Maiz-Arevalo & JR Zamorano-Mansilla (eds), Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations. Benjamins Translation Library, vol. 158, John Benjamins, pp. 101-124. https://doi.org/10.1075/btl.158.04ran

APA

Ranasinghe, T., Mitkov, R., Orasan, C., & Quintana, R. C. (2021). Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools? In J. Lavid-Lopez, C. Maiz-Arevalo, & J. R. Zamorano-Mansilla (Eds.), Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations (pp. 101-124). (Benjamins Translation Library; Vol. 158). John Benjamins. https://doi.org/10.1075/btl.158.04ran

Vancouver

Ranasinghe T, Mitkov R, Orasan C, Quintana RC. Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools? In Lavid-Lopez J, Maiz-Arevalo C, Zamorano-Mansilla JR, editors, Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations. John Benjamins. 2021. p. 101-124. (Benjamins Translation Library). doi: 10.1075/btl.158.04ran

Author

Ranasinghe, Tharindu ; Mitkov, Ruslan ; Orasan, Constantin et al. / Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools?. Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations. editor / Julia Lavid-Lopez ; Carmen Maiz-Arevalo ; Juan Rafael Zamorano-Mansilla. John Benjamins, 2021. pp. 101-124 (Benjamins Translation Library).

Bibtex

@inbook{2b90e3044efa41118317fb668c348ca9,
title = "Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools?",
abstract = "This study proposes an original methodology to underpin the operation of new generation Translation Memory (TM) systems where the translations to be retrieved from the TM database are matched not on the basis of Levenshtein (edit) distance but by employing innovative Natural Language Processing (NLP) and Deep Learning (DL) techniques. Three DL sentence encoders were experimented with to retrieve TM matches in English-Spanish sentence pairs from the DGT TM dataset. Each sentence encoder was compared with Okapi which uses edit distance to retrieve the best match. 1 The automatic evaluation shows the benefit of the DL technology for TM matching and holds promise for the implementation of the TM tool itself, which is our next project.",
keywords = "Deep learning, Machine translation, Okapi, Semantic similarity, Textual similarity, Translation memory",
author = "Tharindu Ranasinghe and Ruslan Mitkov and Constantin Orasan and Quintana, {Roc{\'i}o Caro}",
year = "2021",
month = dec,
day = "8",
doi = "10.1075/btl.158.04ran",
language = "English",
series = "Benjamins Translation Library",
publisher = "John Benjamins",
pages = "101--124",
editor = "Julia Lavid-Lopez and Carmen Maiz-Arevalo and Zamorano-Mansilla, {Juan Rafael}",
booktitle = "Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations",

}

RIS

TY - CHAP

T1 - Semantic textual similarity based on deep learning: Can it improve matching and retrieval for Translation Memory tools?

AU - Ranasinghe, Tharindu

AU - Mitkov, Ruslan

AU - Orasan, Constantin

AU - Quintana, Rocío Caro

PY - 2021/12/8

Y1 - 2021/12/8

N2 - This study proposes an original methodology to underpin the operation of new generation Translation Memory (TM) systems where the translations to be retrieved from the TM database are matched not on the basis of Levenshtein (edit) distance but by employing innovative Natural Language Processing (NLP) and Deep Learning (DL) techniques. Three DL sentence encoders were experimented with to retrieve TM matches in English-Spanish sentence pairs from the DGT TM dataset. Each sentence encoder was compared with Okapi which uses edit distance to retrieve the best match. 1 The automatic evaluation shows the benefit of the DL technology for TM matching and holds promise for the implementation of the TM tool itself, which is our next project.

AB - This study proposes an original methodology to underpin the operation of new generation Translation Memory (TM) systems where the translations to be retrieved from the TM database are matched not on the basis of Levenshtein (edit) distance but by employing innovative Natural Language Processing (NLP) and Deep Learning (DL) techniques. Three DL sentence encoders were experimented with to retrieve TM matches in English-Spanish sentence pairs from the DGT TM dataset. Each sentence encoder was compared with Okapi which uses edit distance to retrieve the best match. 1 The automatic evaluation shows the benefit of the DL technology for TM matching and holds promise for the implementation of the TM tool itself, which is our next project.

KW - Deep learning

KW - Machine translation

KW - Okapi

KW - Semantic similarity

KW - Textual similarity

KW - Translation memory

U2 - 10.1075/btl.158.04ran

DO - 10.1075/btl.158.04ran

M3 - Chapter

T3 - Benjamins Translation Library

SP - 101

EP - 124

BT - Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations

A2 - Lavid-Lopez, Julia

A2 - Maiz-Arevalo, Carmen

A2 - Zamorano-Mansilla, Juan Rafael

PB - John Benjamins

ER -