Skip to content

Commit 28b32bf

Browse files
committed
Automated leaderboard update
1 parent f8dec20 commit 28b32bf

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

Diff for: docs/data_AlpacaEval_2/weighted_alpaca_eval_gpt4_turbo_leaderboard.csv

+1
Original file line numberDiff line numberDiff line change
@@ -33,6 +33,7 @@ CausalLM-14B,11.146160870124424,1391,https://huggingface.co/CausalLM/14B,https:/
3333
OpenChat V3.1 13B,11.08223048948354,1484,https://github.com/imoneoi/openchat,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/openchat-v3.1-13b/model_outputs.json,community
3434
Zephyr 7B Beta,10.99288575525315,1444,https://huggingface.co/HuggingFaceH4/zephyr-7b-beta,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/zephyr-7b-beta/model_outputs.json,community
3535
CUT 13B,10.779089202472866,1637,https://github.com/wwxu21/CUT,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/cut-13b/model_outputs.json,community
36+
OpenHermes-2.5-Mistral (7B),10.340415705751552,1107,https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/OpenHermes-2.5-Mistral-7B/model_outputs.json,verified
3637
Humpback LLaMa2 70B,10.121771502758886,1107,https://arxiv.org/abs/2308.06259,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/humpback-llama2-70b/model_outputs.json,community
3738
Tulu 2+DPO 13B,10.11978838839624,1614,https://huggingface.co/allenai/tulu-2-dpo-13b,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/tulu-2-dpo-13b/model_outputs.json,community
3839
GPT 3.5 Turbo 0301,9.622453295105588,827,,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gpt-3.5-turbo-0301/model_outputs.json,verified

0 commit comments

Comments
 (0)