CUDA: fix Gemma 2 numerical issues for FA (#9166) #14393
Job | Run time |
---|---|
13m 41s | |
18m 49s | |
33m 54s | |
1h 33m 39s | |
1h 30m 36s | |
1h 38m 10s | |
38m 11s | |
37m 35s | |
16m 15s | |
15m 3s | |
7h 35m 53s |
Job | Run time |
---|---|
13m 41s | |
18m 49s | |
33m 54s | |
1h 33m 39s | |
1h 30m 36s | |
1h 38m 10s | |
38m 11s | |
37m 35s | |
16m 15s | |
15m 3s | |
7h 35m 53s |