fBERT: A Neural Transformer for Identifying Offensive Content

Computing and Communications

Text available via DOI:

https://doi.org/10.18653/v1/2021.findings-emnlp.154
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Diptanu Sarkar
Marcos Zampieri
Tharindu Ranasinghe
Alex Ororbia

More...

Publication date	7/11/2021
Host publication	Findings of the Association for Computational Linguistics: EMNLP 2021
Place of Publication	Stroudsburg, PA
Publisher	Association for Computational Linguistics
Pages	1792-1798
Number of pages	7
ISBN (electronic)	9781955917100
<mark>Original language</mark>	English
Event	The 2021 Conference on Empirical Methods in Natural Language Processing - Barceló Bávaro Convention Centre, Punta Cana, Dominican Republic Duration: 7/11/2021 → 11/11/2021 https://2021.emnlp.org/

Conference

Conference	The 2021 Conference on Empirical Methods in Natural Language Processing
Abbreviated title	EMNLP 2021
Country/Territory	Dominican Republic
City	Punta Cana
Period	7/11/21 → 11/11/21
Internet address	https://2021.emnlp.org/

Conference

Conference	The 2021 Conference on Empirical Methods in Natural Language Processing
Abbreviated title	EMNLP 2021
Country/Territory	Dominican Republic
City	Punta Cana
Period	7/11/21 → 11/11/21
Internet address	https://2021.emnlp.org/

Abstract

Transformer-based models such as BERT, XLNET, and XLM-R have achieved state-of-the-art performance across various NLP tasks including the identification of offensive language and hate speech, an important problem in social media. In this paper, we present fBERT, a BERT model retrained on SOLID, the largest English offensive language identification corpus available with over 1.4 million offensive instances. We evaluate fBERT’s performance on identifying offensive content on multiple English datasets and we test several thresholds for selecting instances from SOLID. The fBERT model will be made freely available to the community.

Research

Links

Text available via DOI: