[asynctp] Optimize agmm lastdim via addmm_ #190

IvanKobzarev · 2025-10-06T08:38:38Z

Stacked PRs:

->[asynctp] Optimize agmm lastdim via addmm_ #190

[asynctp] Optimize agmm lastdim via addmm_

stack-info: PR: #190, branch: IvanKobzarev/stack/7

fmassa

I don't have all the context on this file yet, but changes LGTM in general.

fmassa · 2025-10-08T09:17:00Z

autoparallel/asynctp_ops.py

-                outputs[idx] += output_partials[idx]
+            out = outputs[idx]
+            if first:
+                torch.ops.aten.mm.out(shard, B_shards[idx][rank], **kwargs, out=out)


Should we prefer using the torch.mm version instead of the torch.ops.aten.mm version? I'm not sure there is effectively a difference, but maybe for consistency?

Yeah, I think there should not be much difference, we can use torch.mm.

eellison

test?

IvanKobzarev · 2025-10-08T13:10:43Z

test?

Oh, yeah, I want to add e2e test but on torchtitan/autoparallel with asynctp/bucketing/overlap configs once configs are landed pytorch/torchtitan#1838

eellison · 2025-10-08T13:34:02Z

Yea - thought this was in pytorch repro at first / more stand alone.. less easy here.

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 6, 2025

IvanKobzarev force-pushed the IvanKobzarev/stack/7 branch from 89626c4 to 0ce9c0c Compare October 6, 2025 08:38

IvanKobzarev mentioned this pull request Oct 6, 2025

DEBUG asynctp #191

Open

IvanKobzarev requested a review from fmassa October 6, 2025 08:39

[asynctp] Optimize agmm lastdim via addmm_

357dd7e

stack-info: PR: #190, branch: IvanKobzarev/stack/7

IvanKobzarev force-pushed the IvanKobzarev/stack/7 branch from 0ce9c0c to 357dd7e Compare October 8, 2025 11:34

fmassa approved these changes Oct 8, 2025

View reviewed changes

eellison reviewed Oct 8, 2025

View reviewed changes

IvanKobzarev merged commit bd31bea into main Oct 8, 2025
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[asynctp] Optimize agmm lastdim via addmm_ #190

[asynctp] Optimize agmm lastdim via addmm_ #190

Uh oh!

IvanKobzarev commented Oct 6, 2025 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

fmassa Oct 8, 2025

Uh oh!

IvanKobzarev Oct 8, 2025

Uh oh!

eellison left a comment

Uh oh!

Uh oh!

IvanKobzarev commented Oct 8, 2025 •

edited

Loading

Uh oh!

eellison commented Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[asynctp] Optimize agmm lastdim via addmm_ #190

[asynctp] Optimize agmm lastdim via addmm_ #190

Uh oh!

Conversation

IvanKobzarev commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!