Clarify default MMQ for CUDA and LLAMA_CUDA_FORCE_MMQ flag (#8115) #14133
Job | Run time |
---|---|
3m 30s | |
2m 29s | |
3m 35s | |
1m 21s | |
1m 19s | |
4m 47s | |
2m 39s | |
1s | |
3m 20s | |
18s | |
23m 19s |
Job | Run time |
---|---|
3m 30s | |
2m 29s | |
3m 35s | |
1m 21s | |
1m 19s | |
4m 47s | |
2m 39s | |
1s | |
3m 20s | |
18s | |
23m 19s |