feature(normalize gen_kwargs; add `truncation_side` to vllm)! by baberabb · Pull Request #3509 · EleutherAI/lm-evaluation-harness

baberabb · 2026-01-21T13:11:26Z

Added a utility to normalize gen_kwargs. Normalizes do_sample and temperature to be consistent across models.
Added truncation_side=right|middle|left arg to vllm to use other than left truncation. Will see about adding this to huggingface as well, but slightly non-trivial as HF requires sampling params per batch while vllm can run different per sample.

closes #3505 where there was an inconsistency in sampling params between HF and vllm when both do_sample: false AND a non-zero temperature is specified in a task config. Now normalized to:

Config	Result
Nothing specified	Greedy (temp=0.0)
`temperature: 0.8` (no do_sample)	Sampling (temp=0.8)
`do_sample: false`	Greedy (temp=0.0)
`do_sample: false, temperature: 0.8`	Greedy (temp forced to 0.0)
`do_sample: true, temperature: 0.8`	Sampling (temp=0.8)

# Conflicts: # lm_eval/models/vllm_causallms.py

…on logic

vllm: set temp=0 when do_sample=False

28e0827

baberabb marked this pull request as draft January 21, 2026 13:38

baberabb added 7 commits January 21, 2026 19:05

fix

e651ae1

add gen_kwarg utility

01b738e

vllm standardize gen_kwargs

26914e0

add truncation utility

afb4195

fix. add tests

b3053e8

fix. add tests

04025bf

fix!

4514256

baberabb force-pushed the vllm_do_sample branch from 2961d03 to 4514256 Compare January 23, 2026 15:31

baberabb added 7 commits January 23, 2026 20:42

nit

6c9bdd8

always require do_sample

acb32ff

use gen_kwarg utility in HF

46d4ad8

add defaults

bcb90a4

nits

520edbf

Merge branch 'main' into vllm_do_sample

9acaf27

# Conflicts: # lm_eval/models/vllm_causallms.py

fix: update generation kwargs handling and improve max token extracti…

71ba3b9

…on logic

baberabb marked this pull request as ready for review January 26, 2026 20:03

baberabb added 3 commits January 27, 2026 01:07

Merge branch 'main' into vllm_do_sample

16c25f0

pacify pre-commit

4f3120b

fix

a990fa7

baberabb changed the title ~~fix(vllm)!: set temp=0 when do_sample=False~~ feature(normalize gen_kwargs; add truncation types to vllm)! Jan 26, 2026

baberabb changed the title ~~feature(normalize gen_kwargs; add truncation types to vllm)!~~ feature(normalize gen_kwargs; add truncation_side to vllm)! Jan 26, 2026

baberabb added 5 commits January 27, 2026 21:10

default batch auto

116b752

Merge branch 'main' into vllm_do_sample

19e843a

handle huggingface max_length

8ffc12e

add tests

6d54016

update docs

fa1fb4d

baberabb merged commit 30d4e2e into main Jan 27, 2026
6 checks passed

baberabb deleted the vllm_do_sample branch January 27, 2026 17:55

hmellor mentioned this pull request Feb 10, 2026

Fix call to modify_gen_kwargs in vllm_vlms.py #3573

Merged

baberabb mentioned this pull request Feb 11, 2026

refactor(vllm): inline gen_kwargs normalization to modify_gen_kwargs; fix cached gen_kwargs #3582

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(normalize gen_kwargs; add `truncation_side` to vllm)!#3509

feature(normalize gen_kwargs; add `truncation_side` to vllm)!#3509
baberabb merged 23 commits intomainfrom
vllm_do_sample

baberabb commented Jan 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

baberabb commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

baberabb commented Jan 21, 2026 •

edited

Loading