From b8b7209f642ee9727064c8afff1ff11ee873705e Mon Sep 17 00:00:00 2001
From: Jun Yang <143764042+juney-nvidia@users.noreply.github.com>
Date: Fri, 1 Aug 2025 16:51:43 +0800
Subject: [PATCH] Exposing the latest tech blogs in README.md
Signed-off-by: Jun Yang <143764042+juney-nvidia@users.noreply.github.com>
---
README.md | 7 +++++++
1 file changed, 7 insertions(+)
diff --git a/README.md b/README.md
index 15449460963..0d89d541284 100644
--- a/README.md
+++ b/README.md
@@ -18,6 +18,13 @@ TensorRT-LLM
## Tech Blogs
+
+* [08/01] Scaling Expert Parallelism in TensorRT-LLM (Part 2: Performance Status and Optimization)
+✨ [➡️ link](./docs/source/blogs/tech_blog/blog8_Scaling_Expert_Parallelism_in_TensorRT-LLM_part2.md)
+
+* [07/26] N-Gram Speculative Decoding in TensorRT‑LLM
+✨ [➡️ link](./docs/source/blogs/tech_blog/blog_7_NGram_performance_Analysis_And_Auto_Enablement.md)
+
* [06/19] Disaggregated Serving in TensorRT-LLM
✨ [➡️ link](./docs/source/blogs/tech_blog/blog5_Disaggregated_Serving_in_TensorRT-LLM.md)