IgboBERT Models - Research Portal | Lancaster University

Associated organisational units

Keywords

Igbo, named entity recognition, BERT models, under-resourced, dataset

IgboBERT Models: Building and Training Transformer Models for the Igbo Language

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Publication date	20/06/2022
Host publication	LREC 2022 Conference Proceedings
Editors	Nicoletta Calzolari
Place of Publication	Paris
Publisher	European Language Resources Association (ELRA)
Pages	5114–5122
Number of pages	9
ISBN (electronic)	9781095546726
<mark>Original language</mark>	English
Event	13th Language Resources and Evaluation Conference - Marseille, France Duration: 20/06/2022 → 25/06/2022 https://lrec2022.lrec-conf.org/en/

Conference

Conference	13th Language Resources and Evaluation Conference
Abbreviated title	LREC 2022
Country/Territory	France
City	Marseille
Period	20/06/22 → 25/06/22
Internet address	https://lrec2022.lrec-conf.org/en/

Conference

Conference	13th Language Resources and Evaluation Conference
Abbreviated title	LREC 2022
Country/Territory	France
City	Marseille
Period	20/06/22 → 25/06/22
Internet address	https://lrec2022.lrec-conf.org/en/

Abstract

This work presents a standard Igbo named entity recognition (IgboNER) dataset as well as the results from training and fine-tuning state-of-the-art transformer IgboNER models. We discuss the process of our dataset creation - data collection and annotation and quality checking. We also present experimental processes involved in building an IgboBERT language model from scratch as well as fine-tuning it along with other non-Igbo pre-trained models for the downstream IgboNER task. Our results show that, although the IgboNER task benefited hugely from fine-tuning large transformer model, fine-tuning a transformer model built from scratch with comparatively little Igbo text data seems to yield quite decent results for the IgboNER task. This work will contribute immensely to IgboNLP in particular as well as the wider African and low-resource NLP efforts.

Research

Associated organisational units

Links

Keywords

IgboBERT Models: Building and Training Transformer Models for the Igbo Language

Conference

Conference

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us