Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
CUDA: revert part of the RDNA1 optimizations (#8309)
The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s
- Loading branch information