Skip to content

Commit 0972f7d

Browse files
committed
removed extra line breaks
1 parent efb766e commit 0972f7d

File tree

1 file changed

+0
-2
lines changed

1 file changed

+0
-2
lines changed

examples/models/core/llama3_3/deployment-guide-for-trt-llm-llama3.3-70b.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,5 @@
11
# Deployment Guide for TensorRT-LLM Llama3.3 70B FP8 and NVFP4
22

3-
##
4-
53
# Introduction
64

75
This deployment guide provides step-by-step instructions for running the Llama 3.3-70B Instruct model using TensorRT-LLM with FP8 and NVFP4 quantization, optimized for NVIDIA GPUs. It covers the complete setup required; from accessing model weights and preparing the software environment to configuring TensorRT-LLM parameters, launching the server, and validating inference output.

0 commit comments

Comments
 (0)