Home > Research > Publications & Outputs > MasakhaNER

Links

Text available via DOI:

View graph of relations

MasakhaNER: Named Entity Recognition for African Languages

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

MasakhaNER: Named Entity Recognition for African Languages. / Masakhane.
In: Transactions of the Association for Computational Linguistics, Vol. 9, 01.10.2021, p. 1116-1131.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Masakhane 2021, 'MasakhaNER: Named Entity Recognition for African Languages', Transactions of the Association for Computational Linguistics, vol. 9, pp. 1116-1131. https://doi.org/10.1162/tacl_a_00416

APA

Masakhane (2021). MasakhaNER: Named Entity Recognition for African Languages. Transactions of the Association for Computational Linguistics, 9, 1116-1131. https://doi.org/10.1162/tacl_a_00416

Vancouver

Masakhane. MasakhaNER: Named Entity Recognition for African Languages. Transactions of the Association for Computational Linguistics. 2021 Oct 1;9:1116-1131. doi: 10.1162/tacl_a_00416

Author

Masakhane. / MasakhaNER : Named Entity Recognition for African Languages. In: Transactions of the Association for Computational Linguistics. 2021 ; Vol. 9. pp. 1116-1131.

Bibtex

@article{954d4346dbd34caaa4b7dde087188871,
title = "MasakhaNER: Named Entity Recognition for African Languages",
abstract = "We take a step towards addressing the under- representation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of state- of-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP.1",
author = "Masakhane and Adelani, {David Ifeoluwa} and Jade Abbott and Graham Neubig and Daniel D{\textquoteright}souza and Julia Kreutzer and Constantine Lignos and Chester Palen-Michel and Happy Buzaaba and Shruti Rijhwani and Sebastian Ruder and Stephen Mayhew and Azime, {Israel Abebe} and Muhammad, {Shamsuddeen H.} and Emezue, {Chris Chinenye} and Joyce Nakatumba-Nabende and Perez Ogayo and Aremu Anuoluwapo and Catherine Gitau and Derguene Mbaye and Jesujoba Alabi and Yimam, {Seid Muhie} and Gwadabe, {Tajuddeen Rabiu} and Ignatius Ezeani and Niyongabo, {Rubungo Andre} and Jonathan Mukiibi and Verrah Otiende and Iroro Orife and Davis David and Samba Ngom and Tosin Adewumi and Paul Rayson and Mofetoluwa Adeyemi and Gerald Muriuki and Emmanuel Anebi and Chiamaka Chukwuneke and Nkiruka Odu and Wairagala, {Eric Peter} and Samuel Oyerinde and Clemencia Siro and Bateesa, {Tobius Saul} and Temilola Oloyede and Yvonne Wambui and Victor Akinode and Deborah Nabagereka and Maurice Katusiime and Ayodele Awokoya and Mouhamadane MBOUP and Dibora Gebreyohannes and Henok Tilaye and Kelechi Nwaike",
year = "2021",
month = oct,
day = "1",
doi = "10.1162/tacl_a_00416",
language = "English",
volume = "9",
pages = "1116--1131",
journal = "Transactions of the Association for Computational Linguistics",
issn = "2307-387X",
publisher = "MIT Press Journals",

}

RIS

TY - JOUR

T1 - MasakhaNER

T2 - Named Entity Recognition for African Languages

AU - Masakhane

AU - Adelani, David Ifeoluwa

AU - Abbott, Jade

AU - Neubig, Graham

AU - D’souza, Daniel

AU - Kreutzer, Julia

AU - Lignos, Constantine

AU - Palen-Michel, Chester

AU - Buzaaba, Happy

AU - Rijhwani, Shruti

AU - Ruder, Sebastian

AU - Mayhew, Stephen

AU - Azime, Israel Abebe

AU - Muhammad, Shamsuddeen H.

AU - Emezue, Chris Chinenye

AU - Nakatumba-Nabende, Joyce

AU - Ogayo, Perez

AU - Anuoluwapo, Aremu

AU - Gitau, Catherine

AU - Mbaye, Derguene

AU - Alabi, Jesujoba

AU - Yimam, Seid Muhie

AU - Gwadabe, Tajuddeen Rabiu

AU - Ezeani, Ignatius

AU - Niyongabo, Rubungo Andre

AU - Mukiibi, Jonathan

AU - Otiende, Verrah

AU - Orife, Iroro

AU - David, Davis

AU - Ngom, Samba

AU - Adewumi, Tosin

AU - Rayson, Paul

AU - Adeyemi, Mofetoluwa

AU - Muriuki, Gerald

AU - Anebi, Emmanuel

AU - Chukwuneke, Chiamaka

AU - Odu, Nkiruka

AU - Wairagala, Eric Peter

AU - Oyerinde, Samuel

AU - Siro, Clemencia

AU - Bateesa, Tobius Saul

AU - Oloyede, Temilola

AU - Wambui, Yvonne

AU - Akinode, Victor

AU - Nabagereka, Deborah

AU - Katusiime, Maurice

AU - Awokoya, Ayodele

AU - MBOUP, Mouhamadane

AU - Gebreyohannes, Dibora

AU - Tilaye, Henok

AU - Nwaike, Kelechi

PY - 2021/10/1

Y1 - 2021/10/1

N2 - We take a step towards addressing the under- representation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of state- of-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP.1

AB - We take a step towards addressing the under- representation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of state- of-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP.1

U2 - 10.1162/tacl_a_00416

DO - 10.1162/tacl_a_00416

M3 - Journal article

VL - 9

SP - 1116

EP - 1131

JO - Transactions of the Association for Computational Linguistics

JF - Transactions of the Association for Computational Linguistics

SN - 2307-387X

ER -