Update on the development branch #2503
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Nov 26, 2024.
This update includes:
examples/sdxl/README.md
. Thanks for the contribution from @Zars19 in Support SDXL and its distributed inference #1514.max_num_tokens
dynamic tuning feature, it can be enabled by setting--enable_max_num_tokens_tuning
togptManagerBenchmark
.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions