Relevant Datasets - weiyinc11/HateSpeechModerationTwitch GitHub Wiki

Malicious URLs

https://github.com/aymeam/Datasets-for-Hate-Speech-Detection?tab=readme-ov-file
https://uli.tattle.co.in/
This paper
- used this dataset (SBIC) from 2020.
  - 150k structured annotations of social media posts, covering over 34k implications about a thousand demographic groups. Focuses on implications and bias. Hand labeled with free text justifications. Limited in terms of in-group messages.
- They also used this dataset (hatexplain)
  - 20k posts from Twitter and Gab, and ask Amazon Mechanical Turk (MTurk) workers to annotate these posts. Includes groups referenced in annotation.
This paper combines 3 datasets:
- This dataset
- Dynamically generated hatespeech dataset
- SBIC above
This paper expands hatexplain