112 results for “topic:hatespeech”
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
[DEPRECATED] A browser extension to block likers, retweeters, list members and Twitter ads and share your block lists with others. - say NO to hate speech!
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
Deep Learning models to detect hate speech in tweets
A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content
Python code to detect hate speech and classify twitter texts using NLP techniques and Machine Learning
This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment", accepted by COLING2022.
This repository contains papers and resources pertaining to Hate speech research.
Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021
This is a python project that is used to identify hate speech in tweets. The dataset used to train the model is available on Kaggle and consists of labelled tweets where 1 indicates hate speech tweets and 0 indicates non-hate speech tweets.
Repository for the paper "Thou shalt not hate: Countering Online Hate Speech" accepted at ICWSM 2019.
Cyber Hate detection And tracking on Social mEdia
Turkish and English Dataset from "Large-Scale Hate Speech Detection with Cross-Domain Transfer"
NLP model that uses Machine Learning to detect offensive tweets, and classify it's target.
Can fear be used for polarisation and spreading negativity? Our paper accepted in The Web conference 2021 tries to explore this question in light of public Whatsapp groups.
Testing and training detection models for emoji-based hate speech.
Repository for the CLiPS HAte speech DEtection System [HADES].
Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a bayesian model and a proximity model) and a system for weighted voting
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the OLID Dataset (Tweets).
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
KAREN: Unifying Hatespeech Detection and Benchmarking
This is a repository for AfriHate Project
This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.
[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task
CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accepted at IJCAI 2022.
A nlp framework to find hate speech comments out of a comments corpus.
Multilingual Offensive Lexicon consists of the first contextual lexicon for abusive language detection, which is composed of 1,000 explicit and implicit terms and expressions with any pejorative connotation annotated with contextual information