-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Pull requests: hiyouga/LLaMA-Factory
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add a loss_mask to control which outputs from the history are involved in the model's loss calculation.
pending
This problem is yet to be addressed
#6396
opened Dec 19, 2024 by
summerwuxia
Loading…
2 tasks done
Add PEFT add_weighted_adapter() Function for Merging Multiple Adapters
pending
This problem is yet to be addressed
#6310
opened Dec 11, 2024 by
Dlemonha
Loading…
add custom dataset config file as input
#6129
opened Nov 25, 2024 by
ex-yanminmin001
Loading…
2 tasks done
Improve error handling for missing image files in _convert_images
#6128
opened Nov 24, 2024 by
noahc1510
Loading…
2 tasks done
Set 'torch_device' as 'cpu' when loading pretrained adapter
#5993
opened Nov 11, 2024 by
LZHgrla
Loading…
2 tasks done
support granite3 models
pending
This problem is yet to be addressed
#5922
opened Nov 4, 2024 by
Tuyohai
Loading…
2 tasks done
feat: Long Text Fine-Tuning Support
in-progress
The related features are in the progress
pending
This problem is yet to be addressed
#5532
opened Sep 24, 2024 by
glide-the
Loading…
[Update] loader.py , evaluate will run separate evaluations on each eval_dataset
pending
This problem is yet to be addressed
#5522
opened Sep 24, 2024 by
SrWYG
Loading…
Add deepseek-v2.5 template
pending
This problem is yet to be addressed
#5507
opened Sep 21, 2024 by
piamo
Loading…
Flatting Packing / maybe fix #5443 and #5426
pending
This problem is yet to be addressed
#5458
opened Sep 17, 2024 by
AlongWY
Loading…
2 tasks done
Correctly pass gen_kwarg to eval during model runs
pending
This problem is yet to be addressed
#5451
opened Sep 16, 2024 by
aliencaocao
Loading…
1 of 2 tasks
[WIP] add florence2
pending
This problem is yet to be addressed
#5424
opened Sep 12, 2024 by
Sanster
Loading…
2 of 3 tasks
Support for glm-4v-9b with mllm_plugin.
pending
This problem is yet to be addressed
#5343
opened Sep 3, 2024 by
marko1616
Loading…
3 tasks done
add dpop training
pending
This problem is yet to be addressed
#5339
opened Sep 3, 2024 by
threestone965
Loading…
2 tasks done
Support push model to ModelScope community
pending
This problem is yet to be addressed
#5326
opened Sep 2, 2024 by
tastelikefeet
Loading…
1 of 2 tasks
Load huggingface data with revision
pending
This problem is yet to be addressed
#5233
opened Aug 21, 2024 by
noiji
Loading…
2 tasks done
overwrite training_step for CustomDPOTrainer to clear cuda cache every train step
pending
This problem is yet to be addressed
#5019
opened Jul 30, 2024 by
zzc0430
Loading…
2 tasks done
Update src\llamafactory\train\sft\metric.py
pending
This problem is yet to be addressed
#4877
opened Jul 18, 2024 by
01WarpDrive
Loading…
1 of 2 tasks
support ollama modelfile export
pending
This problem is yet to be addressed
#4686
opened Jul 5, 2024 by
codemayq
Loading…
2 tasks done
Add dataset % sample num equally distribute
pending
This problem is yet to be addressed
#3976
opened May 30, 2024 by
Katehuuh
Loading…
1 task done
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.