Ajou University repository

Explainable Hate Speech Detection through Masked Rationale Prediction
  • 김지윤
Citations

SCOPUS

0

Citation Export

Advisor
손경아
Affiliation
아주대학교 일반대학원
Department
일반대학원 인공지능학과
Publication Year
2022-08
Publisher
The Graduate School, Ajou University
Keyword
Explainable NLPHate speech detectionRationale
Description
학위논문(석사)--아주대학교 일반대학원 :인공지능학과,2022. 8
Alternative Abstract
Hate speech detection is important in that the spread of hate speech strengthens critical social discrimination against its target social group not only online but also in the real world. We propose Masked Rationale Prediction (MRP) to improve the performance of hate speech detection considering two important aspects—the model bias and explainability. Understanding the context of hate speech is important for hate speech detection. Hate speech cannot be identified based solely on the presence of specific words considered hateful. However, existing models are easily biased on the specific expressions and make wrong detection results. Even though they correctly predict, the model rationale is often not explained in a convincing manner. Thus, to implement a hate speech detection model, bias and explainability should be considered. MRP is a task to predict the masked human rationales—snippets of a sentence that are grounds for human judgment—by referring to surrounding tokens combined with their unmasked rationales. the human rationales are randomly masked and inputted into the model by being combined with each of the tokens. We pre-finetune a pre-trained model on MRP as an intermediate task and then finetune on hate speech detection. As the model learns its reasoning ability based on rationales by MRP, it performs hate speech detection robustly in terms of bias and explainability. The proposed method generally achieves state-of-the-art performance in various metrics, demonstrating its effectiveness for hate speech detection.
Language
eng
URI
https://dspace.ajou.ac.kr/handle/2018.oak/21125
Fulltext

Type
Thesis
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Total Views & Downloads

File Download

  • There are no files associated with this item.