Home > Research > Publications & Outputs > Transformer Models for Offensive Language Ident...

Electronic data

  • T1-28

    Final published version, 248 KB, PDF document

View graph of relations

Transformer Models for Offensive Language Identification in Marathi

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
Close
Publication date17/12/2021
Host publicationForum for Information Retrieval Evaluation (working notes)
PublisherCEUR Workshop Proceedings
Pages273-282
Number of pages10
<mark>Original language</mark>English
Event13th Annual Meeting of the Forum for Information Retrieval Evaluation -
Duration: 13/12/202117/12/2021

Conference

Conference13th Annual Meeting of the Forum for Information Retrieval Evaluation
Abbreviated titleFIRE
Period13/12/2117/12/21

Publication series

NameCEUR Workshop Proceedings
PublisherCEUR-WS
Volume3159
ISSN (Print)1613-0073

Conference

Conference13th Annual Meeting of the Forum for Information Retrieval Evaluation
Abbreviated titleFIRE
Period13/12/2117/12/21

Abstract

This paper describes the WLV-RIT entry to the Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages (HASOC) shared task of 2021. The HASOC 2021 organizers provided participants with annotated datasets containing social media posts of English, Hindi and Marathi. We participated in Marathi Subtask 1A: identifying hateful, offensive and profane content. In our methodology, we take advantage of available data from high resource languages by applying cross-lingual transformer-based models and transfer learning to make predictions to Marathi data. Our system achieved a macro F1 score of 0.91 for the test set and it ranked 1 st place out of 25 systems.