Interpretable Sexism Detection with Explainable Transformers
- With the widespread growth of social media platforms, instances of racism, cyberbullying, and the use of offensive language have surged. Consequently, women face challenges stemming from the presence of sexist content, which not only impedes their self-improvement but also exacerbates feelings of anxiety. Recognizing online sexism as a harmful phenomenon, the need for an automated tool to detect it has become paramount. This paper proposes an automated framework for extracting insights and identifying sexist language with high accuracy, utilizing machine learning (ML), deep learning (DL), and transformer-based models. Then, we incorporate the explainable AI (XAI) technique to enhance interpretability and make it more understandable to humans. To assess the performance of our method, we conducted experiments using a publicly available dataset focused on sexism. The experimental results underscore the effectiveness of our approach in detecting online sexism, surpassing the performance of several state-of-the-art methods.
| Document Type: | Conference Object |
|---|---|
| Language: | English |
| Author: | Shamima Rayhana, Md Shajalal, Md Atabuzzaman, Gunnar Stevens |
| Parent Title (English): | Biecek, Nowaczyk et al. (Eds.): Joint Proceedings of the xAI 2025 Late-breaking Work, Demos and Doctoral Consortium co-located with the 3rd World Conference on eXplainable Artificial Intelligence (xAI 2025), Istanbul, Turkey, July 9-11, 2025 |
| Number of pages: | 8 |
| First Page: | 153 |
| Last Page: | 160 |
| URN: | urn:nbn:de:hbz:1044-opus-91937 |
| URL: | https://ceur-ws.org/Vol-4017/#paper_20 |
| Publisher: | RWTH Aachen |
| Place of publication: | Aachen, Germany |
| Publishing Institution: | Hochschule Bonn-Rhein-Sieg |
| Date of first publication: | 2025/08/27 |
| Copyright: | © 2025 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). |
| Funding: | This research has been funded by the AntiScam Project (Defense against communication fraud), funded by BMBF Germany, Grant reference 16KIS2214 |
| Tag: | Explainability; LIME; RoBERTa; Sexism Detection; Transformers; XLM-R |
| Departments, institutes and facilities: | Fachbereich Wirtschaftswissenschaften |
| Institut für Verbraucherinformatik (IVI) | |
| Projects: | AntiScam - Verbundprojekt: Abwehr von Conversational Scams zum Schutz der digitalen Identität von Verbraucher:innen (DE/BMFTR/16KIS2214) |
| Dewey Decimal Classification (DDC): | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 005 Computerprogrammierung, Programme, Daten |
| Entry in this database: | 2025/09/05 |
| Licence (German): | Creative Commons - CC BY - Namensnennung 4.0 International |



