CPU/CUDA: Gemma 2 FlashAttention support #14759
Job | Run time |
---|---|
2m 27s | |
8m 53s | |
2m 59s | |
1m 59s | |
18m 39s | |
10m 16s | |
10m 10s | |
2m 49s | |
3m 58s | |
2m 40s | |
2m 26s | |
3m 9s | |
15m 56s | |
1m 35s | |
3m 22s | |
6m 40s | |
2m 9s | |
5m 10s | |
1m 45s | |
1m 57s | |
2m 31s | |
8m 27s | |
2m 36s | |
7m 42s | |
6m 5s | |
5m 7s | |
7m 27s | |
50m 32s | |
7m 55s | |
11m 22s | |
48m 16s | |
11m 33s | |
8m 18s | |
13m 48s | |
8m 15s | |
9m 47s | |
8m 30s | |
7m 24s | |
2m 3s | |
2m 24s | |
0s | |
5h 39m 1s |