Releases · ggerganov/llama.cpp

04 May 17:36

8425001

b2789

gguf-split: add --no-tensor-first-split (#7072)

Assets 19

04 May 14:18

github-actions

b2787

fcd84a0

b2787

Fix Linux /sys cpu path to guess number of cores (#7064)

Assets 19

04 May 06:46

github-actions

b2785

92139b9

b2785

tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)

* tests : add test-tokenizer-0.sh

* unicode : add all unicode number ranges

* starcoder : fix pre-tokenizer

* tests : add test that fails with DeepSeek tokenizers

* falcon : fix regex

* unicode : regenerate unicode tables

* refact : add tokenizer model

* lint : fix

* tests : disable failing tests

ggml-ci

* refact : add tests files

ggml-ci

* convert : print -> logging

ggml-ci

* lint : fix

* unicode : digit -> number

* phi-3 : update

Assets 19

03 May 20:29

github-actions

b2784

a2ac89d

b2784

convert.py : add python logging instead of print() (#6511)

* convert.py: add python logging instead of print()

* convert.py: verbose flag takes priority over dump flag log suppression

* convert.py: named instance logging

* convert.py: use explicit logger id string

* convert.py: convert extra print() to named logger

* convert.py: sys.stderr.write --> logger.error

* *.py: Convert all python scripts to use logging module

* requirements.txt: remove extra line

* flake8: update flake8 ignore and exclude to match ci settings

* gh-actions: add flake8-no-print to flake8 lint step

* pre-commit: add flake8-no-print to flake8 and also update pre-commit version

* convert-hf-to-gguf.py: print() to logger conversion

* *.py: logging basiconfig refactor to use conditional expression

* *.py: removed commented out logging

* fixup! *.py: logging basiconfig refactor to use conditional expression

* constant.py: logger.error then exit should be a raise exception instead

* *.py: Convert logger error and sys.exit() into a raise exception (for atypical error)

* gguf-convert-endian.py: refactor convert_byteorder() to use tqdm progressbar

* verify-checksum-model.py: This is the result of the program, it should be printed to stdout.

* compare-llama-bench.py: add blank line for readability during missing repo response

* reader.py: read_gguf_file() use print() over logging

* convert.py: warning goes to stderr and won't hurt the dump output

* gguf-dump.py: dump_metadata() should print to stdout

* convert-hf-to-gguf.py: print --> logger.debug or ValueError()

* verify-checksum-models.py: use print() for printing table

* *.py: refactor logging.basicConfig()

* gguf-py/gguf/*.py: use __name__ as logger name

Since they will be imported and not run directly.

* python-lint.yml: use .flake8 file instead

* constants.py: logger no longer required

* convert-hf-to-gguf.py: add additional logging

* convert-hf-to-gguf.py: print() --> logger

* *.py: fix flake8 warnings

* revert changes to convert-hf-to-gguf.py for get_name()

* convert-hf-to-gguf-update.py: use triple quoted f-string instead

* *.py: accidentally corrected the wrong line

* *.py: add compilade warning suggestions and style fixes

Assets 19

03 May 14:28

github-actions

b2783

433def2

b2783

llama : rename ctx to user_data in progress_callback (#7045)

* llama : rename ctx to user_data in progress_callback

This commit renames the `ctx` parameter to `user_data` in the
`llama_progress_callback` typedef.

The motivation for this is that other callbacks use `user_data` or
`data`, and using `ctx` in this case might be confusing as it could be
confused with `llama_context`.

---------

Signed-off-by: Daniel Bevenius <[email protected]>

Assets 19

02 May 16:59

github-actions

b2781

6ecf318

b2781

chore: fix typo in llama.cpp (#7032)

Co-authored-by: Jared Van Bortel <[email protected]>

Assets 19

01 May 22:09

github-actions

b2780

b0d943d

b2780

Update LOG_IMPL and LOG_TEE_IMPL (#7029)

ROCm clang defines _MSC_VER which results in the wrong implementation of LOG_IMPL and LOG_TEE_IMPL being compiled.

This fixes https://github.com/ggerganov/llama.cpp/issues/6972

Assets 19

01 May 20:05

github-actions

b2779

8d608a8

b2779

main : fix off by one error for context shift (#6921)

Assets 19

01 May 14:05

github-actions

b2777

1613ef8

b2777

CUDA: CUDART < 11.7 workaround for __hmax, __hmax2 (#7019)

Assets 19

01 May 05:49

github-actions

b2776

c4ec9c0

b2776

ci : exempt confirmed bugs from being tagged as stale (#7014)

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggerganov/llama.cpp

b2789

b2787

b2785

b2784

b2783

b2781

b2780

b2779

b2777

b2776