Fix: attempt to reduce the impact of a worst-case scenario on defragmentation #6037

Xarbirus · 2024-03-13T10:33:09Z

Perhaps this update for the llama_kv_cache_defrag_internal function will help improve the handling of very large holes in the cache.

llama.cpp

…rg#6037) * attempt to reduce the impact of a worst-case scenario * fragmentation calculation fix * Update llama.cpp --------- Co-authored-by: Georgi Gerganov <[email protected]>

Xarbirus mentioned this pull request Mar 13, 2024

Checking llama's defrag_graph size #6019

Closed

Xarbirus added 2 commits March 13, 2024 21:55

attempt to reduce the impact of a worst-case scenario

97ad402

fragmentation calculation fix

38328bb

Xarbirus force-pushed the defrag-update branch from a71b695 to 38328bb Compare March 13, 2024 20:59

Xarbirus marked this pull request as ready for review March 13, 2024 21:01

ggerganov approved these changes Mar 14, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

Update llama.cpp

b88cd9f

ggerganov merged commit 2c4fb69 into ggml-org:master Mar 14, 2024
49 of 60 checks passed

Xarbirus deleted the defrag-update branch April 17, 2024 10:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation #6037

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation #6037

Xarbirus commented Mar 13, 2024

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation #6037

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation #6037

Conversation

Xarbirus commented Mar 13, 2024