[None][doc] add blackwell information into support matrix (NVIDIA#6740)

nv-guomingz · dominicshanshan · commit 0050f49dc7ea · 2025-09-19T00:08:21.000-07:00
Signed-off-by: nv-guomingz &lt;137257613+nv-guomingz@users.noreply.github.com&gt;
Signed-off-by: Wangshanshan &lt;30051912+dominicshanshan@users.noreply.github.com&gt;
diff --git a/docs/source/legacy/reference/support-matrix.md b/docs/source/legacy/reference/support-matrix.md
@@ -157,6 +157,7 @@ The following table shows the supported software for TensorRT-LLM.
   - [10.11](https://docs.nvidia.com/deeplearning/tensorrt/release-notes/index.html)
 * - Precision
   -
+    - Blackwell (SM100/SM120) - FP32, FP16, BF16, FP8, FP4, INT8, INT4
     - Hopper (SM90) - FP32, FP16, BF16, FP8, INT8, INT4
     - Ada Lovelace (SM89) - FP32, FP16, BF16, FP8, INT8, INT4
     - Ampere (SM80, SM86) - FP32, FP16, BF16, INT8, INT4[^smgte89]
diff --git a/docs/source/overview.md b/docs/source/overview.md
@@ -25,8 +25,10 @@ TensorRT LLM delivers breakthrough performance on the latest NVIDIA GPUs:
 
 TensorRT LLM supports the latest and most popular LLM architectures:
 
-- **Language Models**: GPT-OSS, Deepseek-R1/V3, Llama 3/4, Qwen2/3, Gemma 3, Phi 4...
-- **Multi-modal Models**: LLaVA-NeXT, Qwen2-VL, VILA, Llama 3.2 Vision...
+### FP4 Support
+[NVIDIA B200 GPUs](https://www.nvidia.com/en-us/data-center/dgx-b200/) , when used with TensorRT-LLM, enable seamless loading of model weights in the new [FP4 format](https://developer.nvidia.com/blog/introducing-nvfp4-for-efficient-and-accurate-low-precision-inference/#what_is_nvfp4), allowing you to automatically leverage optimized FP4 kernels for efficient and accurate low-precision inference.
+
+### FP8 Support
 
 TensorRT LLM strives to support the most popular models on **Day 0**.
 

Original file line number	Diff line number	Diff line change
`@@ -157,6 +157,7 @@ The following table shows the supported software for TensorRT-LLM.`
`157`	`157`	`- [10.11](https://docs.nvidia.com/deeplearning/tensorrt/release-notes/index.html)`
`158`	`158`	`* - Precision`
`159`	`159`	`-`
	`160`	`+ - Blackwell (SM100/SM120) - FP32, FP16, BF16, FP8, FP4, INT8, INT4`
`160`	`161`	`- Hopper (SM90) - FP32, FP16, BF16, FP8, INT8, INT4`
`161`	`162`	`- Ada Lovelace (SM89) - FP32, FP16, BF16, FP8, INT8, INT4`
`162`	`163`	`- Ampere (SM80, SM86) - FP32, FP16, BF16, INT8, INT4[^smgte89]`