fix thread-reduce performance regression #4279
Annotations
4 notices
Build workflow
Merging consumers for duplicate producer '[C++20 GCC13] Build(amd64)' in 'CUB CTK12.6 nvcc GCC'
|
Build workflow
Original consumers: [C++20 GCC13] HostLaunch(amd64, V100), [C++20 GCC13] TestGPU(amd64, V100)
|
Build workflow
Duplicate consumers: [C++20 GCC13] DeviceLaunch(amd64, V100), [C++20 GCC13] GraphCapture(amd64, V100)
|
Build workflow
Merged consumers: [C++20 GCC13] HostLaunch(amd64, V100), [C++20 GCC13] TestGPU(amd64, V100), [C++20 GCC13] DeviceLaunch(amd64, V100), [C++20 GCC13] GraphCapture(amd64, V100)
|
Loading