[simple_fsdp] Turn on bucketing by default #2103

IvanKobzarev · 2025-12-03T18:11:39Z

Stack from ghstack (oldest at bottom):

-> [simple_fsdp] Turn on bucketing by default #2103

[ghstack-poisoned]

ghstack-source-id: ab5fdb2 Pull Request resolved: #2103

[ghstack-poisoned]

ghstack-source-id: 02c9bdb Pull Request resolved: #2103

tianyu-l

what's the behavior before vs. after? I thought the code you are modifying already does bucketing.

IvanKobzarev · 2025-12-03T21:06:55Z

what's the behavior before vs. after? I thought the code you are modifying already does bucketing.

Before this bucketing was not enabled. Config collective bucketing was not applied if the schedule_overlap is called manually.

ruisizhang123 · 2025-12-04T06:19:00Z

torchtitan/experiments/simple_fsdp/backend.py

                gm: torch.fx.GraphModule, example_inputs: Any
            ) -> torch.fx.GraphModule:
-                schedule_overlap_bucketing(gm)
+                schedule_overlap_bucketing(gm, collective_bucketing=True)


collective_bucketing and insert_overlap_deps configs are turned on in this PR: #1965. Could you confirm which is the correct way to enable this pass?

And probably remove the unused configs

Sorry, it's a bit confusing because we had some internal usage that didnt want the pass to depend on inductor. configs. today, those inductor configs are only used in the inductor psot grad application.

See: https://github.com/pytorch/pytorch/blob/a36e1d39ebbf60976fec5a0d8a96763e6adfbea3/torch/_inductor/fx_passes/post_grad.py#L292-L316

Potentially we can have a :

schedule_overlap_bucketing
and
schedule_overlap_bucketing_from_configs
where the latter reads in inductor configs. I'm not sure. open to ideas here.

oh i see, then probably we can use this PR's config to enable aten-level aot_eager_autobucketing_reordering_pass, and the inductor config to enable inductor post grad passes in inductor_autobucketing_reordering_pass. 🤔

Sorry, didn't fully get it. Does it mean we can remove some code for the aot_eager / inductor option in this PR? Do we have to use multiple toggles for one thing? e.g. I see the following for aot_eager

dist_opts.collective_bucketing = True

But I didn't see any special inductor configs for bucketing.

Yes, I mean @IvanKobzarev need to update the code such that dist_opts are only put to inductor scheduling pass entry before he merges the PR....

Could we add some comment on what each steps are doing, for better readability

I will add a function in pytorch that schedules this from inductor configs. i think that will be clearest.

pytorch/pytorch#169693

we can now just call schedule_overlap_bucketing_from_inductor_configs and use the configs.

tianyu-l · 2025-12-05T18:09:22Z

torchtitan/experiments/simple_fsdp/backend.py

                gm: torch.fx.GraphModule, example_inputs: Any
            ) -> torch.fx.GraphModule:
-                schedule_overlap_bucketing(gm)
+                schedule_overlap_bucketing(gm, collective_bucketing=True)


Sorry, didn't fully get it. Does it mean we can remove some code for the aot_eager / inductor option in this PR? Do we have to use multiple toggles for one thing? e.g. I see the following for aot_eager

dist_opts.collective_bucketing = True

But I didn't see any special inductor configs for bucketing.

[simple_fsdp] Turn on bucketing by default

98209f0

[ghstack-poisoned]

IvanKobzarev added a commit that referenced this pull request Dec 3, 2025

[simple_fsdp] Turn on bucketing by default

357f5ed

ghstack-source-id: ab5fdb2 Pull Request resolved: #2103

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 3, 2025

IvanKobzarev requested review from eellison, tianyu-l and wconstab December 3, 2025 18:11

Update on "[simple_fsdp] Turn on bucketing by default"

56dbbaf

[ghstack-poisoned]

IvanKobzarev added a commit that referenced this pull request Dec 3, 2025

[simple_fsdp] Turn on bucketing by default

5b26395

ghstack-source-id: 02c9bdb Pull Request resolved: #2103

tianyu-l reviewed Dec 3, 2025

View reviewed changes

tianyu-l requested a review from ruisizhang123 December 4, 2025 06:07

ruisizhang123 reviewed Dec 4, 2025

View reviewed changes

eellison approved these changes Dec 5, 2025

View reviewed changes

tianyu-l requested changes Dec 5, 2025

View reviewed changes

[simple_fsdp] Turn on bucketing by default #2103

Are you sure you want to change the base?

[simple_fsdp] Turn on bucketing by default #2103

Uh oh!

Conversation

IvanKobzarev commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

IvanKobzarev commented Dec 3, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ruisizhang123 Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

IvanKobzarev commented Dec 3, 2025 •

edited

Loading

ruisizhang123 Dec 4, 2025 •

edited

Loading