Skip to content

mmmaurer/perspective_argument_retrieval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Repository for GESIS-DSM at The Perspective Argument Retrieval Shared Task 2024:

This repository contains the code for the perspective argument retrieval shared task contribution from the GESIS data science methods team. Find the associated system description paper here.

The system employs a three-step pipeline to retrieve relevant arguments given a query text and a queried socio-cultural attribute.

Data for reproducing this work

To reproduce our work, download the respective dataset cycle from this repository. Per dataset cycle, it should have the following structure:

  • dataset_cycle_n/
    • baseline_queries/
      • train.jsonl
      • val.jsonl
      • test.jsonl
    • perspective_queries/
      • train.jsonl
      • val.jsonl
      • test.jsonl
    • corpus.jsonl

Given this structure, the path dataset_cycle_n/ can then be used as data path in our implementation to load the corpus and queries from.

Repository structure & reproduction

Find implemented functionality of SBERT-based and tf-idf-based rankers in src/models/rankers.py.
Find a script for generating arguments using quantized Mistral-type models in src/models/prompt_llm.py.

generates_arguments/ contains generated arguments per query for the perspective scenarios for each of the shared tasks' three test sets.

src/notebooks/ contains the following notebooks to reproduce our approach, results and analysis:

data/corpus_de_lm_1_nodummy_nomv.csv: Data for reproducing the OLS regression analysis.

Reference

@inproceedings{maurer-etal-2024-gesis,
    title = "{GESIS}-{DSM} at {P}erpective{A}rg2024: A Matter of Style? Socio-Cultural Differences in Argumentation",
    author = "Maurer, Maximilian  and
      Romberg, Julia  and
      Reuver, Myrthe  and
      Weldekiros, Negash  and
      Lapesa, Gabriella",
    booktitle = "Proceedings of the 11th Workshop on Argument Mining (ArgMining 2024)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.argmining-1.18",
    pages = "169--181"
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published