CUDA: fix padding logic for FP16/FP32 (#8884) #14330
Job | Run time |
---|---|
17m 48s | |
19m 28s | |
37m 33s | |
1h 49m 14s | |
1h 49m 24s | |
2h 33m 36s | |
37m 2s | |
42m 43s | |
16m 43s | |
16m 24s | |
9h 19m 55s |
Job | Run time |
---|---|
17m 48s | |
19m 28s | |
37m 33s | |
1h 49m 14s | |
1h 49m 24s | |
2h 33m 36s | |
37m 2s | |
42m 43s | |
16m 43s | |
16m 24s | |
9h 19m 55s |