• shareshare
  • link
  • cite
  • add
Powered by OpenAIRE graph
Found an issue? Give us feedback
Other research product . 2020

Multiclass classification of hate speech using machine learning

Rasmussen, Emma; Hinnerskov, Joakim Hey; Sejsbo, Ask Harup; Kinch, Gustav Weber;
Open Access
Published: 01 Jan 2020
Country: Denmark

This paper revolves around the development of an LSTM multiclass classifier, constructed using Keras as framework and CRISP-DM as project process, with the purpose of classifying natural language into varying degrees of toxicity. The model takes a starting point in an existing toxic comment classification challenge from, and makes a first iteration, engineered towards the requirements in the challenge. In this first iteration, several measures are taken to avoid common pitfalls of neural networks. The model is then held up against principles of freedom of speech including The Harm Principle and The Offence Principle by John Stuart Mill and Joel Feinberg respectively. After evaluating upon the models performance in the light of these principles, a second iteration is constructed with some design changes. For reasons i.a. related to the dataset, this operation is less successful. The paper concludes that it is possible to make a good multiclassification tool for shallow NLP problem, but gets less efficient in later iterations as we try to apply it to more concrete purposes.


Natural Language Processing, LSTM, Hadefulde ytringer, Maskinlæring, Reccurent neural networks, Toxic comment classification

Related Organizations
Powered by OpenAIRE graph
Found an issue? Give us feedback
Related to Research communities
Digital Humanities and Cultural Heritage