Home > Research > Publications & Outputs > Creation of an Evaluation Corpus and Baseline E...

Links

View graph of relations

Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation

Research output: Contribution to conference - Without ISBN/ISSN Conference paperpeer-review

Published
Publication date15/06/2022
Number of pages8
Pages14-21
<mark>Original language</mark>English
EventThe 4th Celtic Language Technology Workshop (CLTW 2022) - Palais du Pharo, Marseille, France
Duration: 20/06/202220/06/2022
Conference number: 4
http://techiaith.bangor.ac.uk/ticeltaidd/

Workshop

WorkshopThe 4th Celtic Language Technology Workshop (CLTW 2022)
Abbreviated titleCLTW 2022
Country/TerritoryFrance
CityMarseille
Period20/06/2220/06/22
Internet address

Abstract

As part of the effort to increase the availability of Welsh digital technology, this paper introduces the first human vs metrics Welsh summarisation evaluation results and dataset, which we provide freely for research purposes to help advance the work on Welsh summarisation. The system summaries were created using an extractive graph-based Welsh summariser. The system summaries were evaluated by both human and a range of ROUGE metric variants (e.g. ROUGE 1, 2, L and SU4). The summaries and evaluation results will serve as benchmarks for the development of summarisers and evaluation metrics in other minority language contexts.