added gsm_plus by ysjprojects · Pull Request #2103 · EleutherAI/lm-evaluation-harness

ysjprojects · 2024-07-15T09:44:29Z

GSM-Plus Math benchmark
Paper: https://arxiv.org/abs/2402.19255

Strengths:

More updated and more capable version of gsm8k

lintangsutawika · 2024-07-15T09:49:07Z

Thanks!

Are you able to run a sanity check, maybe with models like LLaMA-2-7B and see if the eval results are similar to the paper?

ysjprojects · 2024-07-16T06:44:09Z

Thanks!

Are you able to run a sanity check, maybe with models like LLaMA-2-7B and see if the eval results are similar to the paper?

running llama-2-7b, same as evals on paper

ysjprojects · 2024-08-05T16:02:24Z

UPDATE:

Reverted to original GSM-Plus dataset for attribution
Added GSM-Plus_mini subtask

* added gsm_plus * formatted dataset to have train-test-splits * README.md for gsm-plus * Update README.md * GSM-Plus: added gsm_plus_mini * GSM-Plus: attribution to original dataset * Update README.md * Update README.md * Update README.md --------- Co-authored-by: Lintang Sutawika <lintang@eleuther.ai>

added gsm_plus

1873025

ysjprojects requested review from haileyschoelkopf and lintangsutawika as code owners July 15, 2024 09:44

ysjprojects added 2 commits July 16, 2024 13:08

formatted dataset to have train-test-splits

aa16188

README.md for gsm-plus

4538db6

ysjprojects added 3 commits July 16, 2024 14:45

Update README.md

ec0a4df

GSM-Plus: added gsm_plus_mini

d7245f8

GSM-Plus: attribution to original dataset

eb62dad

lintangsutawika added 2 commits August 5, 2024 12:28

Update README.md

37fa245

Update README.md

b55d128

lintangsutawika approved these changes Aug 5, 2024

View reviewed changes

Update README.md

9cfe4e8

lintangsutawika merged commit d8506db into EleutherAI:main Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added gsm_plus#2103

added gsm_plus#2103
lintangsutawika merged 9 commits intoEleutherAI:mainfrom
ysjprojects:gsm-plus

ysjprojects commented Jul 15, 2024

Uh oh!

lintangsutawika commented Jul 15, 2024

Uh oh!

ysjprojects commented Jul 16, 2024

Uh oh!

ysjprojects commented Aug 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ysjprojects commented Jul 15, 2024

Uh oh!

lintangsutawika commented Jul 15, 2024

Uh oh!

ysjprojects commented Jul 16, 2024

Uh oh!

ysjprojects commented Aug 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants