Bug fixes by danielhanchen · Pull Request #1288 · unslothai/unsloth

danielhanchen · 2024-11-14T03:05:33Z

No description provided.

orginal -> original

* Fix DPO, ORPO (#1177) * Fix TRL * Update mistral.py * Patch processing_class * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Installation guide (#1165) * chore: update chat_templates.py (#1166) orginal -> original * Disable Flex Attention * Update tokenizer_utils.py * Update _utils.py * n_items * Update cross_entropy_loss.py * Fix DPO, ORPO * Update _utils.py --------- Co-authored-by: timothelaborie <97834767+timothelaborie@users.noreply.github.com> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com> * Add warning for missing Unpack and KwargsForCausalLM in older Transformers versions --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com> Co-authored-by: timothelaborie <97834767+timothelaborie@users.noreply.github.com> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com>

* Enhance rotary embedding handling in LlamaAttention and LongRopeRotaryEmbedding * Typo * Improve rotary embedding handling in LlamaAttention to prevent errors with short KV cache * Update llama.py * Update llama.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Enhance install_python_non_blocking to handle protobuf installation and process management * Revert "Enhance install_python_non_blocking to handle protobuf installation and process management" This reverts commit f09974b. * Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION to 'python' to address issue #1266 * Revert "Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION to 'python' to address issue #1266" This reverts commit 9fc1307. * Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION to 'python' to address issue #1266 * Update __init__.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Update README.md with os.environ in example Added OS Environ in example to avoid device conflicts , for a user at least in jupyter notebook this allows to select GPU in a multi GPU setup. As currently the unsloth init checks all GPU's and takes the first in the order which can be a issue when some GPU's are in use and the list still shows them. So to manually avoid this, this os config is required. Small change but a bit time saver for those who straight away copies the tutorials * Update README.md --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Refactor `get_chat_template` to now support system message instead. It supposed to fix ollama tokenizer chattemplate to * Remove type hinting * Update chat_templates.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Add patch for SFTTrainer to maintain backward compatibility with TRL changes * Update trainer.py * Update trainer.py * Refactor trainer patch to maintain backward compatibility with TRL changes * Update trainer.py * Refactor trainer.py to exclude non-convertible trainers from backward compatibility patch --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

danielhanchen and others added 30 commits October 21, 2024 01:02

Fix TRL

f0aca90

Update mistral.py

f4ae585

Patch processing_class

106f213

Update tokenizer_utils.py

ef84212

Update tokenizer_utils.py

4f7c527

Update tokenizer_utils.py

aa2b207

Update tokenizer_utils.py

101389d

Update tokenizer_utils.py

c0f0fc9

Update tokenizer_utils.py

b3e0033

Installation guide (#1165)

aabb5ff

chore: update chat_templates.py (#1166)

30bf339

orginal -> original

Disable Flex Attention

2895839

Update tokenizer_utils.py

06f5d75

Update _utils.py

28e6eea

n_items

b821f20

Update cross_entropy_loss.py

e561366

Fix DPO, ORPO

4ff247a

Merge branch 'main' into nightly

2b858a5

Update _utils.py

1c063b4

Update _utils.py

f195ee1

Update cross_entropy_loss.py

5961c34

Update _utils.py

7308bb8

Update _utils.py

0096e5b

Merge branch 'main' into nightly

44b480f

donot upcast lm_head and embeddings to float32 (#1186)

6776055

Cleanup upcast logs (#1188)

625209e

Update transformers

6f28d16

Merge branch 'main' into nightly

f94f7c1

danielhanchen and others added 29 commits November 6, 2024 15:05

Update loader.py

da61c4d

Update loader.py

3316ee2

Update flex_attention.py

501ca84

Update flex_attention.py

ce621b7

Update flex_attention.py

4b01ff1

Update flex_attention.py

ef5052a

Update _utils.py

52bca32

Merge branch 'main' into nightly

68b8d62

Merge branch 'main' into nightly

15da065

Update cross_entropy_loss.py

8b3e9c2

Update _utils.py

3a1e7ef

Update tokenizer_utils.py

f1ec165

Update tokenizer_utils.py

a4e9705

Update tokenizer_utils.py

92c6a27

Update tokenizer_utils.py

673f541

Update tokenizer_utils.py

8fe9109

triton_cast

ad41479

Update utils.py

fcf2009

Qwen 2.5 Coder

af9ba07

Merge branch 'main' into nightly

e99acdd

fix/get_chat_template (#1246)

10565ef

* Refactor `get_chat_template` to now support system message instead. It supposed to fix ollama tokenizer chattemplate to * Remove type hinting * Update chat_templates.py --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

Update __init__.py

84d6d36

Update trainer.py

a31027c

Update trainer.py

035bcce

Update trainer.py

597169c

Update tokenizer_utils.py

11b350f

danielhanchen merged commit 0de5457 into main Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug fixes#1288

Bug fixes#1288
danielhanchen merged 219 commits intomainfrom
nightly

danielhanchen commented Nov 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

Conversation

danielhanchen commented Nov 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants