Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add llama3 405b attention shapes. (#29)
This PR adds the llama3 405b attention shapes that we see in the sharktank export (https://gist.github.com/KyleHerndon/a9c60ce93264d6ba7ec9e878c879f218). We make sure the dynamic sequence length is always a multiple of 16
- Loading branch information