Skip to content

Conversation

clefourrier
Copy link
Member

@clefourrier clefourrier commented Sep 1, 2025

WIP: still need to

  • cherry pick ifeval files and rebase on main
  • test results against papers

@clefourrier clefourrier marked this pull request as draft September 1, 2025 08:25
@HuggingFaceDocBuilderDev
Copy link
Collaborator

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@NathanHB NathanHB marked this pull request as ready for review September 15, 2025 12:35
@NathanHB NathanHB merged commit 3a71f68 into main Sep 16, 2025
5 checks passed
@clefourrier clefourrier mentioned this pull request Sep 18, 2025
NathanHB pushed a commit that referenced this pull request Sep 19, 2025
* init, wip

* unrelated but these tasks were buggy

* better suite management: we don't load all optional deps all the time

* upgrade

* singleton + transformer sampling fix in config

* incredible how much code was just pulled from ifeval

* fix test 1

* fix test 2

* fix tests part 1 - also removes fewshot truncation in the task name because it's no longer used anywhere in the code logically

* fix registry mockup

* fixed last tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants