-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X #8800
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can't reproduce #8798 with this change applied. I think it would be nice if we also added a regression test to test-backend-ops, since you could get it to fail that way.
Fix provided by Slaren
Fix provided by Slaren
Fix provided by Slaren
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it would make sense to also adapt the asserts in dmmv.cu
which currently only assert a multiple of GGML_CUDA_DMMV_X
.
* cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X * update asserts * only use dmmv for supported types * add test
May fix #8798