HATE-ITA: Hate Speech Detection in Italian Social Media Text

Debora Nozza • Federico Bianchi • Giuseppe Attanasio

Model description

HATE-ITA is a binary hate speech classification model for Italian social media text.

See the paper for additional details:

Debora Nozza, Federico Bianchi, and Giuseppe Attanasio. 2022. HATE-ITA: New Baselines for Hate Speech Detection in Italian. In Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH), pages 252–260, Seattle, Washington (Hybrid). Association for Computational Linguistics. Link

License

Code comes from HuggingFace and thus our License is an MIT license.

For models restrictions may apply on the data (which are derived from existing datasets) or Twitter (main data source). We refer users to the original licenses accompanying each dataset and Twitter regulations.

Installing

Important: If you want to use CUDA you need to install the correct version of the CUDA systems that matches your distribution, see PyTorch.

Features

from hate_ita.classifier import HateSpeechClassifier
hc = HateSpeechClassifier()

hc.predict(["ti odio", "come si fa a rompere la lavatrice porca puttana"])

>> ["hate", "not-hate"]

Models

We release three models (see the paper for reference).

from hate_ita.classifier import HateSpeechClassifier
hc = HateSpeechClassifier("twitter")

hc = HateSpeechClassifier("base")

hc = HateSpeechClassifier("large")

Reference

If you use this tool please cite the following paper:

@inproceedings{nozza-etal-2022-hate,
    title = "{HATE}-{ITA}: Hate Speech Detection in {I}talian Social Media Text",
    author = "Nozza, Debora  and
      Bianchi, Federico  and
      Attanasio, Giuseppe",
    booktitle = "Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH)",
    month = jul,
    year = "2022",
    address = "Seattle, Washington (Hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.woah-1.24",
    doi = "10.18653/v1/2022.woah-1.24",
    pages = "252--260"
}

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

License

GNU GPLv3

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.idea		.idea
hate_ita		hate_ita
tests		tests
.gitignore		.gitignore
AUTHORS.rst		AUTHORS.rst
CONTRIBUTING.rst		CONTRIBUTING.rst
HISTORY.rst		HISTORY.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
hateita.png		hateita.png
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HATE-ITA: Hate Speech Detection in Italian Social Media Text

Model description

License

Installing

Features

Models

Reference

Credits

License

About

Releases

Packages

Contributors 2

Languages

License

MilaNLProc/hate-ita

Folders and files

Latest commit

History

Repository files navigation

HATE-ITA: Hate Speech Detection in Italian Social Media Text

Model description

License

Installing

Features

Models

Reference

Credits

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages