About performance of LLama (llama2, llama3) model #215

huazhenliu · 2024-04-25T07:40:35Z

Thank you for your wonderful work!

Have you ever experimented with LLama2-7B as the model to do C-RLFT? How about the performance? Because OpenChat-3.5-0106 is based on Mistral, performance is really high, I have tried using LLama2-7B, the performance is not satisfied.

Another 2 questions: can chat model be used as the model to do C-RLFT? I think, some code needs to be done, e.g., chat template, etc.
How about LLama3-8B-instuct, how to easy train, any performance data?

Thanks in advance.

imoneoi · 2024-04-26T16:58:32Z

Hi @huazhenliu We've tried Llama 2 13B, the performance is worse than Mistral 7B, so we've chosen Mistral 7B as the base model.

For your second question, it's OK. We can do C-RLFT on any model. You can edit the chat template here https://github.com/imoneoi/openchat/blob/master/ochat/config/__init__.py
We're actively working on a new version based on Llama-3-8B

imoneoi added the question Further information is requested label Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About performance of LLama (llama2, llama3) model #215

About performance of LLama (llama2, llama3) model #215

huazhenliu commented Apr 25, 2024

imoneoi commented Apr 26, 2024

About performance of LLama (llama2, llama3) model #215

About performance of LLama (llama2, llama3) model #215

Comments

huazhenliu commented Apr 25, 2024

imoneoi commented Apr 26, 2024