Skip to content

Commit 137413f

Browse files
authored
[None][doc] Exposing the latest tech blogs in README.md (#6553)
Signed-off-by: Jun Yang <[email protected]>
1 parent ba5bdbb commit 137413f

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

README.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,13 @@ TensorRT-LLM
1818
<div align="left">
1919

2020
## Tech Blogs
21+
22+
* [08/01] Scaling Expert Parallelism in TensorRT-LLM (Part 2: Performance Status and Optimization)
23+
[➡️ link](./docs/source/blogs/tech_blog/blog8_Scaling_Expert_Parallelism_in_TensorRT-LLM_part2.md)
24+
25+
* [07/26] N-Gram Speculative Decoding in TensorRT‑LLM
26+
[➡️ link](./docs/source/blogs/tech_blog/blog_7_NGram_performance_Analysis_And_Auto_Enablement.md)
27+
2128
* [06/19] Disaggregated Serving in TensorRT-LLM
2229
[➡️ link](./docs/source/blogs/tech_blog/blog5_Disaggregated_Serving_in_TensorRT-LLM.md)
2330

0 commit comments

Comments
 (0)