-
Notifications
You must be signed in to change notification settings - Fork 10.3k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
added rudimentary support for outetts v0.3 500m and 1b models
examples
#11287
opened Jan 18, 2025 by
LostRuins
Loading…
vulkan: fix coopmat2 validation failures
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#11284
opened Jan 17, 2025 by
jeffbolznv
Loading…
fix: Use Q8_0 for all embedding quantizations for granite and granitemoe
#11283
opened Jan 17, 2025 by
gabe-l-hart
Loading…
simple-chat : fix BOS being added to each message
examples
#11278
opened Jan 17, 2025 by
ggerganov
Loading…
SYCL: SOFTMAX F16 mask support and other fixes
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11261
opened Jan 16, 2025 by
qnixsynapse
Loading…
Adding linenoise.cpp to llama-run
build
Compilation issues
devops
improvements to build systems and github actions
examples
ggml
changes relating to the ggml tensor library for machine learning
#11252
opened Jan 15, 2025 by
ericcurtin
Loading…
SYCL: Introducing memory host pool
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11251
opened Jan 15, 2025 by
s-Nick
Loading…
2 of 4 tasks
fix makefile and cmake logic for AARCH64
ggml
changes relating to the ggml tensor library for machine learning
#11246
opened Jan 15, 2025 by
Haus1
Loading…
AMD: parse the architecture as supplied by gcnArchName
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
script
Script related
testing
Everything test related
#11244
opened Jan 14, 2025 by
Haus1
Loading…
Allow s390x to load little endian models unmodified
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#11234
opened Jan 14, 2025 by
AlekseiNikiforovIBM
Loading…
Allow compiling cuda without mmq and flash attention
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11190
opened Jan 11, 2025 by
milot-mirdita
Loading…
CUDA op getrows fails for long sequences
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11189
opened Jan 11, 2025 by
milot-mirdita
Loading…
Fix ggml-cuda using a driver symbol in NO_VMM mode
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#11188
opened Jan 11, 2025 by
milot-mirdita
Loading…
ggml doesn't use sse42, specify only up to sse4.1
ggml
changes relating to the ggml tensor library for machine learning
#11187
opened Jan 11, 2025 by
milot-mirdita
Loading…
Fix GGML not compiling on macOS with GCC
ggml
changes relating to the ggml tensor library for machine learning
#11185
opened Jan 11, 2025 by
milot-mirdita
Loading…
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
enhancement
New feature or request
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
performance
Speed related topics
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
testing
Everything test related
#11183
opened Jan 10, 2025 by
compilade
Loading…
FR: server: Pre-fill textarea and auto-generate based on query parameters
examples
server
#11150
opened Jan 9, 2025 by
tim-janik
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.