SIOP 2024 Machine Learning Competition

The repository holds competition data, winning solutions / code, and presentations.

Visit the competition portal to learn how the competition process works and view the leaderboard.

Competition Overview

This year, we decided to focus on large language models (LLMs). For more information on LLMs, please visit our LLM primer.

We chose to focus on LLMs because they have demonstrated impressive abilities in NLP (and NLU/NLG). By training on massive text datasets, LLMs can generate human-like text and excel at diverse linguistic tasks. However, thoughtfully harnessing the potential of LLMs for the field of I-O Psychology requires rigorous design and evaluation. This year's competition focused on developing best practices for applying LLMs to I-O tasks. Competitors were required to develop LLM workflows through techniques like prompt engineering, few-shot learning, and fine-tuning using standardized datasets relevant to I-O Psychology. The goal is to benchmark techniques that unlock LLMs' potential as aids for I-O Psychologists through careful design and experimentation. Our goal was to organize a competition that reveals the current abilities of LLMs to assist with workflows in I-O Psychology using public benchmark datasets. Participants report reproducible prompts, results, and analyses to advance best practices for thoughtfully eliciting the strengths of LLMs for professional applications.

Benchmark Datasets

Predicting Empathy: Job candidates were asked to provide empathetic responses to a difficult workplace situation. Your task is to classify whether empathy was demonstrated or not in each simulated response.
Generating Interview Responses: Job candidates responded to 5 common interview questions. You will be given the text of 4 question and response pairs. Your task is to generate a likely text response for the 5th question based on the previous responses.
Rating Item Clarity: Respondents rated the clarity of personality test items using a 7-point scale from 1 = extremely unclear to 7 = extremely clear. Your task is to predict the average clarity rating for each item based on the responses.
Identifying Fairness Perceptions: Respondents compared two organizational policies and voted on which was fairest. Your task is to identify which policy received the majority vote as the fairer option.

Winning Solutions

Please visit the competition slide deck for an overview of this year's competition and winners.

1st place: PAID Team

Zihao Jia
Mina Son
Philseok Lee

Final score = .666

PAID Team's solution

2nd place: Akben&AAron&Elon

Mustafa Akben
Aaron Satko

Final score = .652

Akben&AAron&Elon's solution

3rd place: Hungry Llama

Jennifer Gibson
Shane Halder
Blake Hoffman
Hannah Johnson
Joseph Nicolas Luchman
Nick McCann
Selena Tran

Final score = .643

Hungry Llama's solution

4th place: Wonderlic ML

Guglielmo Menchetti (Wonderlic)
Lea Cleary (Wonderlic)
Annie Brinza (Wonderlic)

Final score = .630

Wonderlic ML's solution

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
00 - data release		00 - data release
01 - PAID Team		01 - PAID Team
02 - Akben&AAron&Elon		02 - Akben&AAron&Elon
03 - Hungry Llama		03 - Hungry Llama
04 - Wonderlic ML		04 - Wonderlic ML
.gitignore		.gitignore
LICENSE		LICENSE
LLM primer.pdf		LLM primer.pdf
README.md		README.md
SIOP 2024 ML Competition Deck.pdf		SIOP 2024 ML Competition Deck.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SIOP 2024 Machine Learning Competition

Competition Overview

Winning Solutions

1st place: PAID Team

2nd place: Akben&AAron&Elon

3rd place: Hungry Llama

4th place: Wonderlic ML

About

Releases

Packages

Contributors 2

Languages

License

izk8/2024_SIOP_Machine_Learning_Competition

Folders and files

Latest commit

History

Repository files navigation

SIOP 2024 Machine Learning Competition

Competition Overview

Winning Solutions

1st place: PAID Team

2nd place: Akben&AAron&Elon

3rd place: Hungry Llama

4th place: Wonderlic ML

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages