Abstract
Hate speech poses a persistent threat to society, causing profound harm to both individuals and communities. Detecting such content is essential for promoting safer and more inclusive environments. While previous research has primarily focused on text-based or image-based hate speech detection, video-based hate detection remains relatively underexplored. A key barrier is the limited availability of high-quality video datasets. Existing hateful video datasets are typically limited in scale, diversity, and annotation depth, often labeling hateful content without further distinguishing between explicit and implicit forms. In this work, we present DeHate, which, to the best of our knowledge, is the largest hateful video dataset to date. DeHate comprises 6689 videos collected from two platforms and spanning six social groups. Each video is annotated with fine-grained labels that differentiate explicit, implicit, and non-hateful content, along with segment-level localization of hate, identification of contributing modalities, and specification of the targeted groups. Through detailed analysis of annotated videos across platforms, we reveal distinct patterns in how hateful content is conveyed, offering a comprehensive comparison between explicit and implicit hate in terms of their prevalence and characteristics. Furthermore, we benchmark state-of-the-art models, including both uni-modal and multi-modal architectures, and identify persistent challenges in detecting subtle and context-dependent forms of hate. Our findings highlight the importance of holistic and fine-grained hateful video datasets for advancing research in hate speech detection. Disclaimer: This paper contains sensitive content that may be disturbing to some readers.
| Original language | English |
|---|---|
| Title of host publication | MM '25 |
| Subtitle of host publication | Proceedings of the 33rd ACM International Conference on Multimedia |
| Publisher | Association for Computing Machinery |
| Pages | 13177-13183 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798400720352 |
| DOIs | |
| Publication status | Published - 27 Oct 2025 |
| Event | 33rd ACM International Conference on Multimedia, MM 2025 - Dublin, Ireland Duration: 27 Oct 2025 → 31 Oct 2025 |
Conference
| Conference | 33rd ACM International Conference on Multimedia, MM 2025 |
|---|---|
| Country/Territory | Ireland |
| City | Dublin |
| Period | 27/10/25 → 31/10/25 |
Bibliographical note
Publisher Copyright:© 2025 Copyright held by the owner/author(s).
Keywords
- dataset
- hate speech
- hateful content detection
- multimodal
ASJC Scopus subject areas
- Human-Computer Interaction
- Software
- Artificial Intelligence
- Computer Graphics and Computer-Aided Design
Fingerprint
Dive into the research topics of 'DeHate: A Holistic Hateful Video Dataset for Explicit and Implicit Hate Detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver