ggml : remove K_QUANTS_PER_ITERATION macro #9034

ggerganov · 2024-08-15T06:15:41Z

Aways use a value of 2

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

jeroen-mostert · 2024-08-24T13:04:39Z

The docs (which are not updated in this PR, by the way) claim that "setting this value to 1 can improve performance for slow GPUs". Is this no longer true? (It doesn't help that no mention is made which GPUs these are supposed to be, as in, class or generation).

ggml-ci

ggerganov · 2024-08-26T06:55:18Z

Not sure, I just haven't noticed this compile option to be used so I think it is not worth keeping the extra code paths from maintenance PoV

jeroen-mostert · 2024-08-26T14:42:42Z

Eh, I suppose it's easy enough to restore if someone who does see performance gains from it complains. It would also be interesting to know who (or rather what) benefits from tweaking GGML_CUDA_DMMV_X and GGML_CUDA_MMV_Y and how much. On my RX 6800 XT varying these has almost no effect at all, but of course that's only one RDNA2 device.

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Aug 15, 2024

ggerganov added 2 commits August 26, 2024 09:52

ggml : remove k_quants_per_iteration macro

e48fd74

ggml-ci

docs : remove references

ccb4518

ggerganov force-pushed the gg/remove-k-quants-per-iter branch from 943f851 to ccb4518 Compare August 26, 2024 06:52

github-actions bot added the documentation Improvements or additions to documentation label Aug 26, 2024

ggerganov closed this Nov 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : remove K_QUANTS_PER_ITERATION macro #9034

ggml : remove K_QUANTS_PER_ITERATION macro #9034

ggerganov commented Aug 15, 2024

jeroen-mostert commented Aug 24, 2024

ggerganov commented Aug 26, 2024

jeroen-mostert commented Aug 26, 2024

ggml : remove K_QUANTS_PER_ITERATION macro #9034

ggml : remove K_QUANTS_PER_ITERATION macro #9034

Conversation

ggerganov commented Aug 15, 2024

jeroen-mostert commented Aug 24, 2024

ggerganov commented Aug 26, 2024

jeroen-mostert commented Aug 26, 2024