[1/9] padding-free #15

garrett361 · 2025-06-27T19:11:52Z

This PR adds the ability to train with padding-free collation, thereby removing all padding from per_device_train_batch_size>1 training examples. The HF model must have proper support for padding-free training, otherwise the model outputs will be silently wrong. LLama and bamba (as of huggingface/transformers#35861) are examples of models which have padding-free training support.

To use padding-free, just specify --padding-free True

	PR	Title
1	>15	padding-free
2	#16	clean_checkpoints_at_end
3	#17	final_lr_ratio
4	#18	add_seed_and_date_to_run_name
5	#19	additional_model_arguments
6	#20	sync_each_batch=True grad acc
7	#21	no grad acc averaging for sum losses
8	#22	extra reporting
9	#23	local_main_process_first when building dataset

fabianlim · 2025-06-27T19:43:04Z

open_instruct/finetune.py

 from accelerate.logging import get_logger
 from accelerate.utils import InitProcessGroupKwargs, set_seed
 from huggingface_hub import HfApi
+from padding_free_collator import TensorDataCollatorWithFlattening


should be open_instruct.padding_free

that makes it more robust w/r/t the cwd the script is launched from, I guess?

dangxuanhong

Thanks and LGTM

padding-free

326af18

garrett361 changed the title ~~[1/9] WIP: padding-free~~ [1/9] padding-free Jun 27, 2025

fix make quality

c112507

garrett361 requested review from Swanand-Kadhe, dangxuanhong and fabianlim June 27, 2025 19:31

fabianlim reviewed Jun 27, 2025

View reviewed changes

fabianlim approved these changes Jun 27, 2025

View reviewed changes

dangxuanhong approved these changes Jun 27, 2025

View reviewed changes

fix import path

df7a397

garrett361 merged commit 872404b into main Jun 27, 2025
2 checks passed

fabianlim deleted the padding-free-squashing-1 branch June 27, 2025 20:35

fabianlim restored the padding-free-squashing-1 branch June 27, 2025 20:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[1/9] padding-free #15

[1/9] padding-free #15

Uh oh!

garrett361 commented Jun 27, 2025 •

edited

Loading

Uh oh!

fabianlim Jun 27, 2025

Uh oh!

garrett361 Jun 27, 2025

Uh oh!

garrett361 Jun 27, 2025

Uh oh!

dangxuanhong left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[1/9] padding-free #15

[1/9] padding-free #15

Uh oh!

Conversation

garrett361 commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fabianlim Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

garrett361 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

garrett361 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

dangxuanhong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

garrett361 commented Jun 27, 2025 •

edited

Loading