Add MCQ support to Yourbench evaluation #734

alozowski · 2025-05-16T07:51:08Z

This PR enables evaluation of MCQ-style QA datasets generated by Yourbench, expanding beyond open-ended QA.

Key Changes

New prompt template for MCQs using and tags.
Custom accuracy logic via custom_metric_compute
Answer parsing handled via extract_content_from_xml_tags helper
Integrated into LightevalTaskConfig as yourbench_mcq

HuggingFaceDocBuilderDev · 2025-05-16T07:53:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

examples/custom_tasks_templates/custom_yourbench_task_mcq.py

…_metric and instruction

clefourrier

Overall lgtm

tests/metrics/test_extractive_match.py

* Add MCQ support to Yourbench evaluation --------- Co-authored-by: Hynek Kydlíček <[email protected]>

Add MCQ support to Yourbench evaluation

9deb402

alozowski requested review from NathanHB and clefourrier May 16, 2025 07:51

Apply Ruff to custom_yourbench_task_mcq.py

df19a81

clefourrier reviewed May 16, 2025

View reviewed changes

alozowski added 2 commits May 16, 2025 15:11

Update custom_yourbench_task_mcq.py to use ExactMatches

e90c74d

Remove XML prompt format and unused metadata fields

9c0fff6

NathanHB added the task-update label May 19, 2025

Merge remote-tracking branch 'origin/main' into mcq-support-yourbench

c7f4ae9

NathanHB reviewed May 19, 2025

View reviewed changes

examples/custom_tasks_templates/custom_yourbench_task_mcq.py Show resolved Hide resolved

examples/custom_tasks_templates/custom_yourbench_task_mcq.py Outdated Show resolved Hide resolved

hynky1999 and others added 3 commits May 19, 2025 17:57

fix extractive match to accept (A) (#746)

74df0b1

Update custom_yourbench_task_mcq to use multilingual_extractive_match…

fc82f57

…_metric and instruction

Fix a typo in test_extractive_match.py

3e33242

clefourrier approved these changes May 20, 2025

View reviewed changes

tests/metrics/test_extractive_match.py Outdated Show resolved Hide resolved

Remove dot in test_extractive_match.py

af31fad

alozowski merged commit 317cb50 into main May 20, 2025
5 checks passed

hynky1999 added a commit that referenced this pull request May 22, 2025

Add MCQ support to Yourbench evaluation (#734)

4c3d414

* Add MCQ support to Yourbench evaluation --------- Co-authored-by: Hynek Kydlíček <[email protected]>

NathanHB pushed a commit that referenced this pull request Sep 19, 2025

Add MCQ support to Yourbench evaluation (#734)

5369eb6

* Add MCQ support to Yourbench evaluation --------- Co-authored-by: Hynek Kydlíček <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add MCQ support to Yourbench evaluation #734

Add MCQ support to Yourbench evaluation #734

Uh oh!

alozowski commented May 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clefourrier left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add MCQ support to Yourbench evaluation #734

Add MCQ support to Yourbench evaluation #734

Uh oh!

Conversation

alozowski commented May 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clefourrier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants