Commit 9367328
Signed-off-by: Yan Chunwei <[email protected]>
Signed-off-by: Ludwig Schneider <[email protected]>
pre-commit changes
Signed-off-by: Ludwig Schneider <[email protected]>
clang formatting
Signed-off-by: Ludwig Schneider <[email protected]>
safe guarding NCCL 2.27 build
Signed-off-by: Ludwig Schneider <[email protected]>
fixing precommit formatting
Signed-off-by: Ludwig Schneider <[email protected]>
most of code rabbit comments
Signed-off-by: Ludwig Schneider <[email protected]>
adding missing semi-colon
Signed-off-by: Ludwig Schneider <[email protected]>
removing unused comment lines
Signed-off-by: Ludwig Schneider <[email protected]>
Clarifying the test on how to compre residual chunked and unchunked.
Signed-off-by: Ludwig Schneider <[email protected]>
fixing pre-commit
Signed-off-by: Ludwig Schneider <[email protected]>
fixing pre-commit
Signed-off-by: Ludwig Schneider <[email protected]>
fixing missing variable, rebase complete and tested
Signed-off-by: Ludwig Schneider <[email protected]>
using a grid stride loop with less blocks launched for large message sizes
Signed-off-by: Ludwig Schneider <[email protected]>
using functioning grid stride loop for NCCL_DEVICE. It helps with better performance at larger message sizes
Signed-off-by: Ludwig Schneider <[email protected]>
initial oneshot implementation
Signed-off-by: Ludwig Schneider <[email protected]>
minor tweaks to include one shot
fixes
Signed-off-by: Ludwig Schneider <[email protected]>
enabling grid stride loop, but no perf benefit.
Signed-off-by: Ludwig Schneider <[email protected]>
addressing review feedback
Signed-off-by: Ludwig Schneider <[email protected]>
fix formatting
Signed-off-by: Ludwig Schneider <[email protected]>
1 parent 56bf9d0 commit 9367328
File tree
16 files changed
+1305
-925
lines changed- cpp/tensorrt_llm
- kernels
- nccl_device
- userbuffers
- plugins/ncclPlugin
- thop
- tensorrt_llm
- _torch/pyexecutor
- llmapi
- tests/unittest/_torch/multi_gpu
16 files changed
+1305
-925
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1442 | 1442 | | |
1443 | 1443 | | |
1444 | 1444 | | |
1445 | | - | |
| 1445 | + | |
1446 | 1446 | | |
1447 | 1447 | | |
1448 | 1448 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | | - | |
| 1 | + | |
| 2 | + | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | | - | |
10 | | - | |
| 8 | + | |
11 | 9 | | |
12 | 10 | | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
18 | 15 | | |
19 | 16 | | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
24 | 20 | | |
25 | 21 | | |
26 | | - | |
27 | | - | |
28 | | - | |
| 22 | + | |
29 | 23 | | |
30 | 24 | | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
0 commit comments