Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B #176

GeneZC · 2023-11-26T03:31:04Z

I have not idea why these two outputs are surprisingly different. So we here update the results, and the results seem to be more reasonable now.

We are very sorry for the misunderstanding, and we kindly suggest a highlight on the difference between above two outputs in the README.

BTW, we have added MiniChat-1.5-3B to AlpacaEval, which is incorporated with NEFT and DPO to yield much better performance.

YannDubs

Thanks @GeneZC, I'll clarify which outputs to use to avoid this issue in the future!
congrats for the new results :)

GeneZC added 7 commits November 26, 2023 00:10

Add files via upload

8456041

Update alpaca_eval_gpt4_leaderboard.csv

13fe720

Create configs.yaml

3c82cd0

Update configs.yaml

60b3cd3

Create model_outputs.json

7ef3788

Update results of minichat-1.5-3b

a88525a

Update alpaca_eval_gpt4_leaderboard.csv

a21fc76

rtaori requested a review from YannDubs November 26, 2023 07:14

YannDubs approved these changes Nov 26, 2023

View reviewed changes

YannDubs merged commit b226e30 into tatsu-lab:main Nov 26, 2023
2 checks passed

Provide feedback