Skip to content

Conversation

@tshmilnvidia
Copy link
Owner

@tshmilnvidia tshmilnvidia commented Jun 25, 2025

Nixl support for GDS

Commit 1 & 2 ("BlockManager copy constructor fix" + "Pass mode & directory") are both fixes that I believe are needed in TRT-LLM first, not related to NIXL.

Commit 3 ("GDS Debug") can be ignored for review.

Commit 4 & 5 ("Nixl Loopback Agent" + "copyBlock with NixlLoopbackAgent") Are the relevant commits for GDS w/ NIXL.

@tshmilnvidia tshmilnvidia marked this pull request as draft June 25, 2025 14:46
@tshmilnvidia tshmilnvidia force-pushed the nixl_agent branch 3 times, most recently from 6dc47e6 to 74b88cf Compare June 29, 2025 08:41
@tshmilnvidia tshmilnvidia force-pushed the nixl_agent branch 4 times, most recently from 7210604 to 96a28f6 Compare July 6, 2025 11:55
@tshmilnvidia tshmilnvidia force-pushed the nixl_agent branch 5 times, most recently from 57f2533 to b7dbd4b Compare July 16, 2025 10:11
@glevnv glevnv force-pushed the nixl_agent branch 8 times, most recently from 271fd4c to 4237fc5 Compare August 19, 2025 09:34
leslie-fang25 and others added 13 commits September 4, 2025 07:31
* Why?

Initial profiling showed there were multiple D2H / H2D copies being
scheduled in the mistral 3.1 small model.

* What?

This commit removes those unnecessary copies by returning `image_sizes`
as a simple list instead of a tensor.

Signed-off-by: William Zhang <[email protected]>
@tshmilnvidia tshmilnvidia force-pushed the nixl_agent branch 2 times, most recently from 351b864 to 507570f Compare September 4, 2025 10:34
@tshmilnvidia tshmilnvidia force-pushed the nixl_agent branch 2 times, most recently from becd2a4 to 6b92b32 Compare September 8, 2025 08:33
tshmilnvidia and others added 6 commits September 8, 2025 01:44
Implement class LoopbackAgent.

Signed-off-by: Tomer Shmilovich <[email protected]>
Signed-off-by: Tomer Shmilovich <[email protected]>
Signed-off-by: Tomer Shmilovich <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.