Skip to content

Fix Bugs#101

Merged
danielhanchen merged 729 commits into
mainfrom
nightly
Mar 26, 2025
Merged

Fix Bugs#101
danielhanchen merged 729 commits into
mainfrom
nightly

Conversation

@danielhanchen
Copy link
Copy Markdown
Member

Solves:

  1. Gemma vision requires grad failed
  2. Pixtral & Mistral3 SDPA does NOT work
  3. Qwen Vision inference gibberish
  4. finetune_vision_layers **kwargs
  5. UNSLOTH_DISABLE_AUTO_UPDATES
  6. Fix Mistral V3 Data Collator not working

rolandtannous and others added 29 commits March 24, 2025 15:39
* gemma require gradients fix

* Update peft_utils.py

---------

Co-authored-by: Daniel Han <danielhanchen@gmail.com>
…pointing-warning

Fix gradient checkpointing warning filter implementation
Fix bugs in generate_batches.py.Original output = [] will result in duplication of results.
* Support Iterable Datasets

* Update dataset_utils.py

* Update dataset_utils.py

* Update dataset_utils.py

* Update dataset_utils.py

* Preserve batch size from iterable dataset

* Preserve batch size from iterable dataset

* Support train_on_response_only with IterableDataset

* Support train_on_response_only with IterableDataset

* Support train_on_response_only with IterableDataset

* Support train_on_response_only with IterableDataset
@danielhanchen danielhanchen merged commit 4a66f8b into main Mar 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants