Home > Research > Publications & Outputs > Transferred Embeddings for Igbo Similarity, Ana...
View graph of relations

Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks. / Ezeani, Ignatius; Onyenwe, Ikechukwu E.; Hepple, Mark.
COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings. 2018. p. 30-38 (COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Ezeani, I, Onyenwe, IE & Hepple, M 2018, Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks. in COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings. COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings, pp. 30-38. <https://aclanthology.org/W18-4004/>

APA

Ezeani, I., Onyenwe, I. E., & Hepple, M. (2018). Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks. In COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings (pp. 30-38). (COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings). https://aclanthology.org/W18-4004/

Vancouver

Ezeani I, Onyenwe IE, Hepple M. Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks. In COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings. 2018. p. 30-38. (COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings).

Author

Ezeani, Ignatius ; Onyenwe, Ikechukwu E. ; Hepple, Mark. / Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks. COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings. 2018. pp. 30-38 (COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings).

Bibtex

@inproceedings{e4e8fb308aa241bf92c984f03630980a,
title = "Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks.",
abstract = "Existing NLP models are mostly trained with data from well-resourced languages. Most minority languages face the challenge of lack of resources - data and technologies - for NLP research. Building these resources from scratch for each minority language will be very expensive, time-consuming and amount largely to unnecessarily re-inventing the wheel. In this paper, we applied transfer learning techniques to create Igbo word embeddings from a variety of existing English trained embeddings. Transfer learning methods were also used to build standard datasets for Igbo word similarity and analogy tasks for intrinsic evaluation of embeddings. These projected embeddings were also applied to diacritic restoration task. Our results indicate that the projected models not only outperform the trained ones on the semantic-based tasks of analogy, word-similarity, and odd-word identifying, but they also achieve enhanced performance on the diacritic restoration with learned diacritic embeddings.",
author = "Ignatius Ezeani and Onyenwe, {Ikechukwu E.} and Mark Hepple",
note = "DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.",
year = "2018",
month = aug,
language = "English",
series = "COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings",
pages = "30--38",
booktitle = "COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings",

}

RIS

TY - GEN

T1 - Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks.

AU - Ezeani, Ignatius

AU - Onyenwe, Ikechukwu E.

AU - Hepple, Mark

N1 - DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

PY - 2018/8

Y1 - 2018/8

N2 - Existing NLP models are mostly trained with data from well-resourced languages. Most minority languages face the challenge of lack of resources - data and technologies - for NLP research. Building these resources from scratch for each minority language will be very expensive, time-consuming and amount largely to unnecessarily re-inventing the wheel. In this paper, we applied transfer learning techniques to create Igbo word embeddings from a variety of existing English trained embeddings. Transfer learning methods were also used to build standard datasets for Igbo word similarity and analogy tasks for intrinsic evaluation of embeddings. These projected embeddings were also applied to diacritic restoration task. Our results indicate that the projected models not only outperform the trained ones on the semantic-based tasks of analogy, word-similarity, and odd-word identifying, but they also achieve enhanced performance on the diacritic restoration with learned diacritic embeddings.

AB - Existing NLP models are mostly trained with data from well-resourced languages. Most minority languages face the challenge of lack of resources - data and technologies - for NLP research. Building these resources from scratch for each minority language will be very expensive, time-consuming and amount largely to unnecessarily re-inventing the wheel. In this paper, we applied transfer learning techniques to create Igbo word embeddings from a variety of existing English trained embeddings. Transfer learning methods were also used to build standard datasets for Igbo word similarity and analogy tasks for intrinsic evaluation of embeddings. These projected embeddings were also applied to diacritic restoration task. Our results indicate that the projected models not only outperform the trained ones on the semantic-based tasks of analogy, word-similarity, and odd-word identifying, but they also achieve enhanced performance on the diacritic restoration with learned diacritic embeddings.

M3 - Conference contribution/Paper

T3 - COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings

SP - 30

EP - 38

BT - COLING 2018 - 3rd Workshop on Semantic Deep Learning, SemDeep 2018 - Proceedings

ER -