Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ggml : fix odd blocks for ARM_NEON #8556

Merged
merged 10 commits into from
Jul 19, 2024

Conversation

ggerganov
Copy link
Owner

@ggerganov ggerganov commented Jul 18, 2024

target #8549


@ggerganov ggerganov changed the title ggml : fix q4_1 ggml : fix odd blocks for ARM_NEON Jul 18, 2024
@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Jul 18, 2024
ggml/src/ggml-quants.c Show resolved Hide resolved
ggml/src/ggml-quants.c Show resolved Hide resolved
@ggerganov ggerganov force-pushed the gg/fix-odd-blocks-arm branch from 3a515b8 to 974410a Compare July 18, 2024 12:00
@slaren slaren force-pushed the sl/fix-iqnl-odd-blocks branch from 90e8f81 to cc6a0f5 Compare July 18, 2024 20:11
ggml/src/ggml-quants.c Outdated Show resolved Hide resolved
@ggerganov
Copy link
Owner Author

Thanks, will merge after #8549 is merged

@github-actions github-actions bot added the testing Everything test related label Jul 19, 2024
@slaren
Copy link
Collaborator

slaren commented Jul 19, 2024

@ggerganov the target of this PR is #8549, so I believe this would need to be merged first. Do you intend to create a new PR targeting master afterwards?

@ggerganov
Copy link
Owner Author

ggerganov commented Jul 19, 2024

Correct, it's better to merge this PR into #8549 first - will update now

@mofosyne mofosyne added the Review Complexity : High Generally require indepth knowledge of LLMs or GPUs label Jul 19, 2024
@ggerganov ggerganov requested a review from slaren July 19, 2024 14:01
@ggerganov ggerganov merged commit 8cc26be into sl/fix-iqnl-odd-blocks Jul 19, 2024
47 of 52 checks passed
@ggerganov ggerganov deleted the gg/fix-odd-blocks-arm branch July 19, 2024 14:13
slaren added a commit that referenced this pull request Jul 19, 2024
* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix odd blocks for ARM_NEON (#8556)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix q4_1

* ggml : fix q5_0

* ggml : fix q5_1

* ggml : fix iq4_nl metal

ggml-ci

* ggml : fix q4_0

* ggml : fix q8_0

ggml-ci

* ggml : remove special Q4_0 code for first 2 blocks

* ggml : fix sumf redefinition

---------

Co-authored-by: slaren <[email protected]>

---------

Co-authored-by: Georgi Gerganov <[email protected]>
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 27, 2024
* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix odd blocks for ARM_NEON (ggerganov#8556)

* ggml : fix iq4_nl dot product with odd number of blocks

* ggml : fix q4_1

* ggml : fix q5_0

* ggml : fix q5_1

* ggml : fix iq4_nl metal

ggml-ci

* ggml : fix q4_0

* ggml : fix q8_0

ggml-ci

* ggml : remove special Q4_0 code for first 2 blocks

* ggml : fix sumf redefinition

---------

Co-authored-by: slaren <[email protected]>

---------

Co-authored-by: Georgi Gerganov <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ggml changes relating to the ggml tensor library for machine learning Review Complexity : High Generally require indepth knowledge of LLMs or GPUs testing Everything test related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants