Norm Violation Detection

This is the repo for norm violation detection on Reddit.

Data (Skip this if you're just doing inference)

Decompress the data file

tar -xzf data.tar.gz

Checkpoint (Skip this if you're training the model from scratch)

The BERT-LSTM model checkpoint is accessible from this Google Drive link. Download the model to ckps/clf/bert-base-uncased/1/seed=2022.

The T5 model checkpoint is accessible from this Google Drive link. Download the model to ckps/prompt/t5-base/1/seed=2022.

Using the API

Instantiate the API:

Using BERT

python api-inference.py --task=clf --model_name=bert-base-uncased

Using T5

python api-inference.py --task=prompt --mdoel_name=t5-base

If GPUs are available, specify the GPU(s) to use:

python api-inference.py --task=prompt --model_name=t5-base --gpu=0

Then prepare the query data as in api-test-data.json.

Calling the API:

curl -X POST -H 'Content-Type: application/json' -d '@api-test-data.json' http://localhost:5000/api

The following is an example of the API. In the input file api-test-data.json there are three conversations, along with there subreddits and specified rules. The model outputs the confidence scores that the last comments from the conversations violate their corresponding rules.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
ckps		ckps
results		results
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
api-example.png		api-example.png
api-inference-test.py		api-inference-test.py
api-inference.py		api-inference.py
api-test-data.json		api-test-data.json
data.tar.gz		data.tar.gz
dataset.py		dataset.py
dataset_inference.py		dataset_inference.py
eval.py		eval.py
evaluator.py		evaluator.py
evaluator_prompt.py		evaluator_prompt.py
models.py		models.py
requirements.txt		requirements.txt
run_eval.py		run_eval.py
run_train.py		run_train.py
train.py		train.py
trainer.py		trainer.py
trainer_prompt.py		trainer_prompt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Norm Violation Detection

Data (Skip this if you're just doing inference)

Checkpoint (Skip this if you're training the model from scratch)

Using the API

About

Releases

Packages

Languages

isi-nlp/norm_vio_detection

Folders and files

Latest commit

History

Repository files navigation

Norm Violation Detection

Data (Skip this if you're just doing inference)

Checkpoint (Skip this if you're training the model from scratch)

Using the API

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages