Skip to content

Pull requests: hiyouga/LLaMA-Factory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add a loss_mask to control which outputs from the history are involved in the model's loss calculation. pending This problem is yet to be addressed
#6396 opened Dec 19, 2024 by summerwuxia Loading…
2 tasks done
Add PEFT add_weighted_adapter() Function for Merging Multiple Adapters pending This problem is yet to be addressed
#6310 opened Dec 11, 2024 by Dlemonha Loading…
add custom dataset config file as input
#6129 opened Nov 25, 2024 by ex-yanminmin001 Loading…
2 tasks done
Improve error handling for missing image files in _convert_images
#6128 opened Nov 24, 2024 by noahc1510 Loading…
2 tasks done
Set 'torch_device' as 'cpu' when loading pretrained adapter
#5993 opened Nov 11, 2024 by LZHgrla Loading…
2 tasks done
support granite3 models pending This problem is yet to be addressed
#5922 opened Nov 4, 2024 by Tuyohai Loading…
2 tasks done
inital changes into enable openai finetuning
#5606 opened Oct 4, 2024 by danikhan632 Loading…
feat: Long Text Fine-Tuning Support in-progress The related features are in the progress pending This problem is yet to be addressed
#5532 opened Sep 24, 2024 by glide-the Loading…
[Update] loader.py , evaluate will run separate evaluations on each eval_dataset pending This problem is yet to be addressed
#5522 opened Sep 24, 2024 by SrWYG Loading…
Add deepseek-v2.5 template pending This problem is yet to be addressed
#5507 opened Sep 21, 2024 by piamo Loading…
[Draft] Add AutoRound support
#5486 opened Sep 19, 2024 by wenhuach21 Draft
1 of 2 tasks
Flatting Packing / maybe fix #5443 and #5426 pending This problem is yet to be addressed
#5458 opened Sep 17, 2024 by AlongWY Loading…
2 tasks done
Correctly pass gen_kwarg to eval during model runs pending This problem is yet to be addressed
#5451 opened Sep 16, 2024 by aliencaocao Loading…
1 of 2 tasks
[WIP] add florence2 pending This problem is yet to be addressed
#5424 opened Sep 12, 2024 by Sanster Loading…
2 of 3 tasks
Support for glm-4v-9b with mllm_plugin. pending This problem is yet to be addressed
#5343 opened Sep 3, 2024 by marko1616 Loading…
3 tasks done
add dpop training pending This problem is yet to be addressed
#5339 opened Sep 3, 2024 by threestone965 Loading…
2 tasks done
Support push model to ModelScope community pending This problem is yet to be addressed
#5326 opened Sep 2, 2024 by tastelikefeet Loading…
1 of 2 tasks
Load huggingface data with revision pending This problem is yet to be addressed
#5233 opened Aug 21, 2024 by noiji Loading…
2 tasks done
overwrite training_step for CustomDPOTrainer to clear cuda cache every train step pending This problem is yet to be addressed
#5019 opened Jul 30, 2024 by zzc0430 Loading…
2 tasks done
docs: add Japanese README
#4957 opened Jul 24, 2024 by eltociear Loading…
1 task done
Update src\llamafactory\train\sft\metric.py pending This problem is yet to be addressed
#4877 opened Jul 18, 2024 by 01WarpDrive Loading…
1 of 2 tasks
merge easycontext
#4733 opened Jul 9, 2024 by qianhao0713 Loading…
support ollama modelfile export pending This problem is yet to be addressed
#4686 opened Jul 5, 2024 by codemayq Loading…
2 tasks done
Feature/support qwenvl glm4-v phi3-v(conflict resolving) pending This problem is yet to be addressed
#4377 opened Jun 19, 2024 by marko1616 Draft
2 tasks done
Add dataset % sample num equally distribute pending This problem is yet to be addressed
#3976 opened May 30, 2024 by Katehuuh Loading…
1 task done
ProTip! Filter pull requests by the default branch with base:main.