Adds a dummy/random model #220

guipenedo · 2024-07-08T14:20:48Z

Inspired by https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/models/dummy.py, this PR adds a random/dummy model that can be used to establish random baselines and to test/debug lighteval.
Can be run with

python run_evals_accelerate.py --model_args dummy ...

DummyModelConfig has a single option: seed. To pass a seed: --model_args dummy,seed=123. The seed is used to randomly generate logprobs

Two notes:

had to change default values for truncated_tokens_count and padded_tokens_count otherwise evaluation_tracker.details_logger.aggregate crashes (https://github.com/huggingface/lighteval/blob/main/src/lighteval/logging/info_loggers.py#L420 and https://github.com/huggingface/lighteval/blob/main/src/lighteval/logging/info_loggers.py#L424)
had to add an actual tokenizer (used gpt2) as even after overwriting tok_encode and tok_decode some other methods would still call the tokenizer directly

NathanHB

Great addition ! The code is very clean nothing to say. Could you add some doc in the README however ? :)

guipenedo added 13 commits July 8, 2024 16:04

added dummy model

c9aa91b

fix no tokenizer issues

40f065b

fix types issue

151dcc0

fix issue with truncated_tokens_count

94dd43a

fix issue with truncated_tokens_count huggingface#2

2a1e427

add seed option

24d7265

fix seed declaration

e9553ce

add seed to modal_sha

311d68b

run ruff

baafc49

ruff again

6feef85

add noqa

efbf6f0

fix style again...

ef946c0

fixed noqa (?)

3d5aeb3

NathanHB reviewed Jul 8, 2024

View reviewed changes

add docs

b0e8048

guipenedo requested a review from NathanHB July 8, 2024 16:04

clefourrier approved these changes Jul 9, 2024

View reviewed changes

clefourrier merged commit 70f7fc6 into huggingface:main Jul 9, 2024

hynky1999 pushed a commit that referenced this pull request May 22, 2025

Adds a dummy/random model for baseline init (#220)

b749ca5

NathanHB pushed a commit that referenced this pull request Sep 19, 2025

Adds a dummy/random model for baseline init (#220)

d41e957

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds a dummy/random model #220

Adds a dummy/random model #220

Uh oh!

guipenedo commented Jul 8, 2024

Uh oh!

NathanHB left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adds a dummy/random model #220

Adds a dummy/random model #220

Uh oh!

Conversation

guipenedo commented Jul 8, 2024

Uh oh!

NathanHB left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants