Hate Speech detection is a critical area of research aimed at identifying harmful and offensive content.
However, the task is hindered by challenges related to perspectivism and bias, which compromise the efficacy and fairness of detection systems, especially in a real-case scenario.
Perspectivism frequently emerges due to the subjective interpretation of what constitutes hate speech, which is influenced by cultural, linguistic, and contextual factors.
On the other hand, Bias is frequently associated with a skewed distribution of specific elements in the datasets that will be used for training hate speech detection models.
The thesis, focusing both on unimodal (text) and multimodal (memes) user-generated content, initially deals with perspectivism to model disagreement at different granularities, i.e., at constituent and instance levels. Subsequently, bias identification and mitigation are tackled by novel metrics and debiasing strategies.
By addressing these challenges, this work aims to improve the reliability and fairness of hate speech detection systems, paving the way for more equitable content moderation strategies, able to capture multiple perspectives and interpretations.
© 2001-2026 Fundación Dialnet · Todos los derechos reservados