Update on the development branch #2095
kaiyux
announced in
Announcements
Replies: 1 comment 1 reply
-
The update link is wrong(skip to pr2053 :D |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Aug 7th, 2024.
This update includes:
examples/chatglm/README.md
.context_fmha_fp32_acc
is moved to runtime for decoder models.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions