Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies

Computing and Communications

Associated organisational units

Text available via DOI:

https://doi.org/10.26615/978-954-452-092-2_013
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. / Nanomi Arachchige, Isuri; Ha, Le An; Mitkov, Ruslan et al.
International Conference Recent Advances in Natural Language Processing, RANLP 2023. ed. / Galia Angelova; Maria Kunilovskaya; Ruslan Mitkov. Association for Computational Linguistics, 2023. p. 117-123 (International Conference Recent Advances in Natural Language Processing, RANLP).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Nanomi Arachchige, I, Ha, LA, Mitkov, R & Nahar, V 2023, Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. in G Angelova, M Kunilovskaya & R Mitkov (eds), International Conference Recent Advances in Natural Language Processing, RANLP 2023. International Conference Recent Advances in Natural Language Processing, RANLP, Association for Computational Linguistics, pp. 117-123. https://doi.org/10.26615/978-954-452-092-2_013

APA

Nanomi Arachchige, I., Ha, L. A., Mitkov, R., & Nahar, V. (2023). Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. In G. Angelova, M. Kunilovskaya, & R. Mitkov (Eds.), International Conference Recent Advances in Natural Language Processing, RANLP 2023 (pp. 117-123). (International Conference Recent Advances in Natural Language Processing, RANLP). Association for Computational Linguistics. https://doi.org/10.26615/978-954-452-092-2_013

Vancouver

Nanomi Arachchige I, Ha LA, Mitkov R, Nahar V. Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. In Angelova G, Kunilovskaya M, Mitkov R, editors, International Conference Recent Advances in Natural Language Processing, RANLP 2023. Association for Computational Linguistics. 2023. p. 117-123. (International Conference Recent Advances in Natural Language Processing, RANLP). doi: 10.26615/978-954-452-092-2_013

Author

Nanomi Arachchige, Isuri ; Ha, Le An ; Mitkov, Ruslan et al. / Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. International Conference Recent Advances in Natural Language Processing, RANLP 2023. editor / Galia Angelova ; Maria Kunilovskaya ; Ruslan Mitkov. Association for Computational Linguistics, 2023. pp. 117-123 (International Conference Recent Advances in Natural Language Processing, RANLP).

Bibtex

@inproceedings{9c79bcad276c494ea80517108223c560,

title = "Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies",

abstract = "Relationship extraction from unstructured data remains one of the most challenging tasks in the field of Natural Language Processing (NLP). The complexity of relationship extraction arises from the need to comprehendthe underlying semantics, syntactic structures, and contextual dependencies within the text. Unstructured data poses challenges with diverse linguistic patterns, implicit relationships, contextual nuances, complicating accuraterelationship identification and extraction.The emergence of Large Language Models (LLMs), such as GPT (Generative Pre-trained Transformer), has indeed marked a significant advancement in the field of NLP. In this work, we assess and evaluate the effectiveness of LLMs in relationship extraction in the Holocaust testimonies within the context of the Historical realm. By delving into this domain specific context, we aim to gain deeper insights into the performance and capabilities of LLMs in accurately capturing and extracting relationships within the Holocaust domain bydeveloping a novel knowledge graph to visualise the relationships of the Holocaust. To the best of our knowledge, there is no existing study which discusses relationship extraction in Holocaust testimonies. The majority ofcurrent approaches for Information Extraction (IE) in historic documents are either manual or Optical Character Recognition (OCR) based.Moreover, in this study, we found that the Subject-Object-Verb extraction using GPT3- based relations produced more meaningful results compared to the Semantic Role labeling based triple extraction.",

author = "{Nanomi Arachchige}, Isuri and Ha, {Le An} and Ruslan Mitkov and Vinitar Nahar",

year = "2023",

month = sep,

day = "9",

doi = "10.26615/978-954-452-092-2_013",

language = "English",

series = "International Conference Recent Advances in Natural Language Processing, RANLP",

publisher = "Association for Computational Linguistics",

pages = "117--123",

editor = "Galia Angelova and Maria Kunilovskaya and Ruslan Mitkov",

booktitle = "International Conference Recent Advances in Natural Language Processing, RANLP 2023",

}

RIS

TY - GEN

T1 - Evaluating Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies

AU - Nanomi Arachchige, Isuri

AU - Ha, Le An

AU - Mitkov, Ruslan

AU - Nahar, Vinitar

PY - 2023/9/9

Y1 - 2023/9/9

N2 - Relationship extraction from unstructured data remains one of the most challenging tasks in the field of Natural Language Processing (NLP). The complexity of relationship extraction arises from the need to comprehendthe underlying semantics, syntactic structures, and contextual dependencies within the text. Unstructured data poses challenges with diverse linguistic patterns, implicit relationships, contextual nuances, complicating accuraterelationship identification and extraction.The emergence of Large Language Models (LLMs), such as GPT (Generative Pre-trained Transformer), has indeed marked a significant advancement in the field of NLP. In this work, we assess and evaluate the effectiveness of LLMs in relationship extraction in the Holocaust testimonies within the context of the Historical realm. By delving into this domain specific context, we aim to gain deeper insights into the performance and capabilities of LLMs in accurately capturing and extracting relationships within the Holocaust domain bydeveloping a novel knowledge graph to visualise the relationships of the Holocaust. To the best of our knowledge, there is no existing study which discusses relationship extraction in Holocaust testimonies. The majority ofcurrent approaches for Information Extraction (IE) in historic documents are either manual or Optical Character Recognition (OCR) based.Moreover, in this study, we found that the Subject-Object-Verb extraction using GPT3- based relations produced more meaningful results compared to the Semantic Role labeling based triple extraction.

AB - Relationship extraction from unstructured data remains one of the most challenging tasks in the field of Natural Language Processing (NLP). The complexity of relationship extraction arises from the need to comprehendthe underlying semantics, syntactic structures, and contextual dependencies within the text. Unstructured data poses challenges with diverse linguistic patterns, implicit relationships, contextual nuances, complicating accuraterelationship identification and extraction.The emergence of Large Language Models (LLMs), such as GPT (Generative Pre-trained Transformer), has indeed marked a significant advancement in the field of NLP. In this work, we assess and evaluate the effectiveness of LLMs in relationship extraction in the Holocaust testimonies within the context of the Historical realm. By delving into this domain specific context, we aim to gain deeper insights into the performance and capabilities of LLMs in accurately capturing and extracting relationships within the Holocaust domain bydeveloping a novel knowledge graph to visualise the relationships of the Holocaust. To the best of our knowledge, there is no existing study which discusses relationship extraction in Holocaust testimonies. The majority ofcurrent approaches for Information Extraction (IE) in historic documents are either manual or Optical Character Recognition (OCR) based.Moreover, in this study, we found that the Subject-Object-Verb extraction using GPT3- based relations produced more meaningful results compared to the Semantic Role labeling based triple extraction.

U2 - 10.26615/978-954-452-092-2_013

DO - 10.26615/978-954-452-092-2_013

M3 - Conference contribution/Paper

T3 - International Conference Recent Advances in Natural Language Processing, RANLP

SP - 117

EP - 123

BT - International Conference Recent Advances in Natural Language Processing, RANLP 2023

A2 - Angelova, Galia

A2 - Kunilovskaya, Maria

A2 - Mitkov, Ruslan

PB - Association for Computational Linguistics

ER -

Research

Associated organisational units

Links

Text available via DOI: