Update on the development branch #2254
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Sep 24, 2024.
This update includes:
examples/deepseek_v1/README.md
.__repr__
methods for classModule
, thanks to the contribution from @1ytic in Add module __repr__ methods #2191.max_batch_size
oftrtllm-build
command is set to 2048.share_embedding
for the models that have nolm_head
in legacy checkpoint conversion path, thanks to the contribution from @lkm2835 in Fix check_share_embedding #2232.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions