what should I do if I want to improve the performance of hellaswag? #2154

mathCrazyy · 2024-12-12T07:31:02Z

I want to find some dataset , for example OpenO1, KD 14B to 3B， or use lora, but I have a bad result:

the result of KD only reach 96.8% of the ori 3B Qwen2.5 model
what should I do? Thanks.

joecummings · 2024-12-12T11:48:58Z

Did you fine-tune the 14B model on your desired dataset first? That's an important pre-step to knowledge distillation.

mathCrazyy · 2024-12-13T02:38:34Z

Sorry I didn't, I mistakenly thought it was not important.

joecummings · 2024-12-13T12:56:04Z

Sorry I didn't, I mistakenly thought it was not important.

All good - give that a go and LMK how it works after re-evaluating

joecummings assigned joecummings and lindawangg Dec 12, 2024

joecummings added the discussion Start a discussion label Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what should I do if I want to improve the performance of hellaswag? #2154

what should I do if I want to improve the performance of hellaswag? #2154

mathCrazyy commented Dec 12, 2024

joecummings commented Dec 12, 2024

mathCrazyy commented Dec 13, 2024

joecummings commented Dec 13, 2024

what should I do if I want to improve the performance of hellaswag? #2154

what should I do if I want to improve the performance of hellaswag? #2154

Comments

mathCrazyy commented Dec 12, 2024

joecummings commented Dec 12, 2024

mathCrazyy commented Dec 13, 2024

joecummings commented Dec 13, 2024