Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Yaml crowspairs tasks
#2488 opened Nov 14, 2024 by NAM00 Loading…
Biology ds
#2486 opened Nov 13, 2024 by deema-A Loading…
MILU dataset from AI4Bharat for Indic LLM eval
#2482 opened Nov 12, 2024 by abhinand5 Loading…
release kbl-v0.1
#2476 opened Nov 10, 2024 by whwang299 Loading…
Update citation
#2474 opened Nov 8, 2024 by Sypherd Loading…
Use global filter alias
#2473 opened Nov 8, 2024 by Sypherd Loading…
IBM watsonx_llm fixes & refactor
#2464 opened Nov 7, 2024 by Medokins Loading…
Score tasks
#2452 opened Nov 4, 2024 by rimashahbazyan Loading…
allow fewshots for multimodal tasks
#2450 opened Nov 1, 2024 by artemorloff Loading…
Add Aggregation for Kobest Benchmark
#2446 opened Oct 31, 2024 by tryumanshow Loading…
fix tmlu tmlu_taiwan_specific_tasks tag
#2420 opened Oct 22, 2024 by nike00811 Loading…
Add YandexGPT API
#2419 opened Oct 21, 2024 by almasgarriev Loading…
Fix Type Hints for vLLM CausalLM model
#2408 opened Oct 18, 2024 by qthequartermasterman Loading…
Update citation links to Zenodo and DOI to 0.4.5
#2391 opened Oct 9, 2024 by LSinev Loading…
add Russian mmlu
#2378 opened Oct 3, 2024 by tatiana-iazykova Loading…
Add the BlueBench benchmark
#2369 opened Oct 1, 2024 by shachardon Loading…
Remove unnecessary space prefix
#2368 opened Oct 1, 2024 by eldarkurtic Loading…
MMLU Pro Plus
#2366 opened Sep 30, 2024 by asgsaeid Loading…
fix cost_estimate script
#2359 opened Sep 26, 2024 by baberabb Draft
Add metabench task to LM Evaluation Harness
#2357 opened Sep 26, 2024 by kozzy97 Loading…
Support pipeline parallel with OpenVINO models
#2349 opened Sep 25, 2024 by sstrehlk Loading…
Mathvista
#2321 opened Sep 18, 2024 by baberabb Draft
mmlu translated professionally by OpenAI
#2312 opened Sep 17, 2024 by giuliolovisotto Loading…
Scrolls branch
#2309 opened Sep 16, 2024 by blitzionic Loading…
ProTip! What’s not been updated in a month: updated:<2024-10-14.