Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

resume_from_checkpoint oom killed pending This problem is yet to be addressed
#6486 opened Dec 30, 2024 by sunrise224
1 task done
如何指定只冻结 LLM 进行多模态模型的训练? pending This problem is yet to be addressed
#6484 opened Dec 30, 2024 by Ben81828
1 task done
Newbie Qs: RLHF fine-tuning & dataset pending This problem is yet to be addressed
#6470 opened Dec 28, 2024 by vtharmalingam
webui多卡VLLM推理错误 pending This problem is yet to be addressed
#6468 opened Dec 28, 2024 by CingyQ
1 task done
Please upgrade transformers and deepspeed version in requirements.txt pending This problem is yet to be addressed
#6460 opened Dec 27, 2024 by randydl
ollama无法加载本地模型 pending This problem is yet to be addressed
#6459 opened Dec 27, 2024 by lx687
1 task done
[rank0]: RuntimeError: tensor does not have a device pending This problem is yet to be addressed
#6454 opened Dec 26, 2024 by Juvenilecris
1 task done
华为昇腾NPU支持QLora训练吗? npu This problem is related to NPU devices pending This problem is yet to be addressed
#6452 opened Dec 26, 2024 by sunxiaoyu12
1 task done
DeepSpeed支持yaml配置文件 pending This problem is yet to be addressed
#6445 opened Dec 25, 2024 by randydl
lora微调Mamba-Codestral-7B-v0.1出现了问题 pending This problem is yet to be addressed
#6434 opened Dec 24, 2024 by tongzeliang
1 task done
寒武纪:咱们是否能支持寒武纪? pending This problem is yet to be addressed
#6429 opened Dec 24, 2024 by y149604146
1 task done
Ascend NPU 910B3采用deepspeed引擎训练,Q1:未调用NPU,Q2:NPU健康状态是否影响训练。 npu This problem is related to NPU devices pending This problem is yet to be addressed
#6428 opened Dec 24, 2024 by Lexlum
1 task done
奖励模型能否不是一个model,而是一个自己定义的函数 pending This problem is yet to be addressed
#6423 opened Dec 23, 2024 by cdhx
1 task done
ppo训练相关问题 pending This problem is yet to be addressed
#6419 opened Dec 22, 2024 by ccp123456789
Tokenizer does not derive the newer config pending This problem is yet to be addressed
#6415 opened Dec 21, 2024 by xiaosu-zhu
1 task done
Questions about resuming training form ckpt pending This problem is yet to be addressed
#6414 opened Dec 21, 2024 by Jiawei-Guo
1 task done
Why Speed per iteration slower when dataset is large pending This problem is yet to be addressed
#6410 opened Dec 20, 2024 by coding2debug
1 task done
How to reproduce the paper results? pending This problem is yet to be addressed
#6387 opened Dec 19, 2024 by StiphyJay
1 task done
LLaMA-Factory对话预期之外存在问题 pending This problem is yet to be addressed
#6386 opened Dec 19, 2024 by 3237522375
1 task done
如何把我训练的奖励模型放到ppo的工作管线里 pending This problem is yet to be addressed
#6385 opened Dec 19, 2024 by chcoo
1 task done
LLava Series (7B, 14B) freeze_vision_tower=false bug pending This problem is yet to be addressed
#6376 opened Dec 18, 2024 by xirui-li
1 task done
ProTip! no:milestone will show everything without a milestone.