Skip to content

Conversation

NathanHB
Copy link
Member

@NathanHB NathanHB commented Sep 15, 2025

adds tasks list to readme, shortens intro paragrpah and add example for python API

@NathanHB NathanHB changed the title adds tasks list to readme, shortens intro paragrpah and add example f… Update readme Sep 15, 2025
@HuggingFaceDocBuilderDev
Copy link
Collaborator

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@clefourrier clefourrier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Super neat, some nits and notably a small reorg suggestion in the task intro to put more relevant evals first

- **Reasoning**: MUSR, DROP (discrete reasoning)
- **Long Context**: RULER
- **Dialogue**: MT-Bench
- **Holistic Evaluation**: HELM, BIG-Bench
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BigBench is also in knowledge

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would change holistic evaluation to "Across the board capabilities testing" or something like this

@NathanHB NathanHB self-assigned this Sep 15, 2025
@NathanHB NathanHB merged commit c409f3f into main Sep 15, 2025
5 of 6 checks passed
NathanHB added a commit that referenced this pull request Sep 19, 2025
adds tasks list to readme, shortens intro paragrpah and add example for python API
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants