Working paper icon

Working paper

Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

Abstract:

Detecting online hate is a complex task, and low-performing detection models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is a key emerging challenge for online hate detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate how detection models perform on hateful language expressed with emoji. Using the test suite, we expose weaknesses in existing hate detection models. To address ...

Expand abstract

Actions


Access Document


Files:

Authors


More by this author
Institution:
University of Oxford
Division:
SSD
Subgroup:
Oxford Internet Institute
Oxford college:
Hertford College
Role:
Author
ORCID:
0000-0002-6894-4951
Publication date:
2021-08-12
Pubs id:
1190679
Local pid:
pubs:1190679
Language:
English
Keywords:

Terms of use


Metrics


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP