Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation

Computing and Communications

Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review

Published

Standard

Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation. / El-Haj, Mahmoud ; Ezeani, Ignatius; Morris, Jonathan et al.
2022. 14-21 Paper presented at The 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France.

Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review

Harvard

El-Haj, M , Ezeani, I, Morris, J & Knight, D 2022, 'Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation', Paper presented at The 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France, 20/06/22 - 20/06/22 pp. 14-21. <http://www.lrec-conf.org/proceedings/lrec2022/workshops/CLTW4/pdf/2022.cltw4-1.3.pdf>

APA

El-Haj, M., Ezeani, I., Morris, J., & Knight, D. (2022). Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation. 14-21. Paper presented at The 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France. http://www.lrec-conf.org/proceedings/lrec2022/workshops/CLTW4/pdf/2022.cltw4-1.3.pdf

Vancouver

El-Haj M , Ezeani I, Morris J, Knight D. Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation. 2022. Paper presented at The 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France.

Author

El-Haj, Mahmoud ; Ezeani, Ignatius ; Morris, Jonathan et al. / Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation. Paper presented at The 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France.8 p.

Bibtex

@conference{92aec905bea54a298e771162e022d480,

title = "Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation",

abstract = "As part of the effort to increase the availability of Welsh digital technology, this paper introduces the first human vs metrics Welsh summarisation evaluation results and dataset, which we provide freely for research purposes to help advance the work on Welsh summarisation. The system summaries were created using an extractive graph-based Welsh summariser. The system summaries were evaluated by both human and a range of ROUGE metric variants (e.g. ROUGE 1, 2, L and SU4). The summaries and evaluation results will serve as benchmarks for the development of summarisers and evaluation metrics in other minority language contexts.",

author = "Mahmoud El-Haj and Ignatius Ezeani and Jonathan Morris and Dawn Knight",

year = "2022",

month = jun,

day = "15",

language = "English",

pages = "14--21",

note = "The 4th Celtic Language Technology Workshop (CLTW 2022), CLTW 2022 ; Conference date: 20-06-2022 Through 20-06-2022",

url = "http://techiaith.bangor.ac.uk/ticeltaidd/",

}

RIS

TY - CONF

T1 - Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation

AU - El-Haj, Mahmoud

AU - Ezeani, Ignatius

AU - Morris, Jonathan

AU - Knight, Dawn

N1 - Conference code: 4

PY - 2022/6/15

Y1 - 2022/6/15

N2 - As part of the effort to increase the availability of Welsh digital technology, this paper introduces the first human vs metrics Welsh summarisation evaluation results and dataset, which we provide freely for research purposes to help advance the work on Welsh summarisation. The system summaries were created using an extractive graph-based Welsh summariser. The system summaries were evaluated by both human and a range of ROUGE metric variants (e.g. ROUGE 1, 2, L and SU4). The summaries and evaluation results will serve as benchmarks for the development of summarisers and evaluation metrics in other minority language contexts.

AB - As part of the effort to increase the availability of Welsh digital technology, this paper introduces the first human vs metrics Welsh summarisation evaluation results and dataset, which we provide freely for research purposes to help advance the work on Welsh summarisation. The system summaries were created using an extractive graph-based Welsh summariser. The system summaries were evaluated by both human and a range of ROUGE metric variants (e.g. ROUGE 1, 2, L and SU4). The summaries and evaluation results will serve as benchmarks for the development of summarisers and evaluation metrics in other minority language contexts.

M3 - Conference paper

SP - 14

EP - 21

T2 - The 4th Celtic Language Technology Workshop (CLTW 2022)

Y2 - 20 June 2022 through 20 June 2022

ER -

Research

Links