Skip to content

Conversation

lewtun
Copy link
Member

@lewtun lewtun commented Apr 8, 2025

Recent papers like SimpleRL-Zoo and VAPO have adopted n=32 as the default estimate for AIME24.

This PR bumps our default to the same value so we align with what others report. See #661 for more details on the variance across n values.

Recent papers like [SimpleRL-Zoo](https://arxiv.org/pdf/2503.18892) and [VAPO](https://arxiv.org/pdf/2504.05118) have adopted `n=32` as the default estimate for AIME24. 

This PR bumps our default to the same value so we align with what others report.
@HuggingFaceDocBuilderDev
Copy link
Collaborator

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@lewtun lewtun merged commit bb14995 into main Apr 8, 2025
5 checks passed
@lewtun lewtun deleted the lewtun-patch-2 branch April 8, 2025 14:37
hynky1999 pushed a commit that referenced this pull request May 22, 2025
Recent papers like [SimpleRL-Zoo](https://arxiv.org/pdf/2503.18892) and [VAPO](https://arxiv.org/pdf/2504.05118) have adopted `n=32` as the default estimate for AIME24. 

This PR bumps our default to the same value so we align with what others report.
NathanHB pushed a commit that referenced this pull request Sep 19, 2025
Recent papers like [SimpleRL-Zoo](https://arxiv.org/pdf/2503.18892) and [VAPO](https://arxiv.org/pdf/2504.05118) have adopted `n=32` as the default estimate for AIME24. 

This PR bumps our default to the same value so we align with what others report.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants