Skip to content

b3316

Compare
Choose a tag to compare
@github-actions github-actions released this 05 Jul 10:38
0a42380
CUDA: revert part of the RDNA1 optimizations (#8309)

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s