This repository contains code and data for the following paper:
@inproceedings{choshen-etal-2019-language,
title = "The Language of Legal and Illegal Activity on the {D}arknet",
author = "Choshen, Leshem and
Eldad, Dan and
Hershcovich, Daniel and
Sulem, Elior and
Abend, Omri",
booktitle = "Proceedings of the 57th Conference of the Association for Computational Linguistics",
month = jul,
year = "2019",
address = "Florence, Italy",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/P19-1419",
pages = "4271--4279"
}
csvs
: Onion labels (e.g., legal/illegal) per websitecyber
: code to read and classify documentsebay
: documents from eBay (product descriptions)ebay_clean
: documents from eBay (product descriptions), after cleaningexperiments
: AllenNLP configuration filesonion
: documents from Onion (website text), classified by labelonion_clean
: documents from Onion, classified by label, after cleaningpaper
: source code for the paper