-
Notifications
You must be signed in to change notification settings - Fork 11k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
HIP/CUDA: set the paramerter value in maintain_cuda_graph instead of replaceing it.
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12209
opened Mar 5, 2025 by
IMbackK
Loading…
feat(CMakeLists): Add MSVC-specific compiler warning flags in CMake configuration
ggml
changes relating to the ggml tensor library for machine learning
#12206
opened Mar 5, 2025 by
25077667
Loading…
opencl: Fix not enough space in the buffer
ggml
changes relating to the ggml tensor library for machine learning
#12197
opened Mar 5, 2025 by
linehill
Loading…
metal : simplify kernel arguments using a struct (#3229)
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12194
opened Mar 5, 2025 by
BB-fat
Loading…
SYCL: Rename oneMKL to oneMath
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192
opened Mar 5, 2025 by
Rbiessy
Loading…
libfuse3 supported mounting split gguf's to a single in-memory file
examples
#12189
opened Mar 5, 2025 by
matbee-eth
•
Draft
vulkan: double buffer scale caches
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12188
opened Mar 4, 2025 by
netrunnereve
Loading…
CUDA: Fix new mma detection for Turing cards with Volta PTX
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12187
opened Mar 4, 2025 by
neilmehta24
Loading…
fix: AVX2 intrinsics, const correctness, and SIMD headers
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#12186
opened Mar 4, 2025 by
sandboxyer
Loading…
CUDA: Improve flash decoding kernel GPU occupancy for BS=1 case
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12183
opened Mar 4, 2025 by
gaugarg-nv
Loading…
1 of 3 tasks
HIP: rocWMMA documentation and enabling in workflow builds
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
#12179
opened Mar 4, 2025 by
hjc4869
Loading…
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code.
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12177
opened Mar 4, 2025 by
IMbackK
Loading…
opencl: Fix changes relating to the ggml tensor library for machine learning
ulong
kernel args were set from int
variables
ggml
#12174
opened Mar 4, 2025 by
linehill
Loading…
1 task done
build: fix build error when build source code on Windows
examples
#12157
opened Mar 3, 2025 by
zhouwg
Loading…
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions
ggml
changes relating to the ggml tensor library for machine learning
#12154
opened Mar 2, 2025 by
remyoudompheng
Loading…
Server: openai-style lookup decoding
examples
python
python script changes
server
#12127
opened Mar 1, 2025 by
eeroel
Loading…
clip.cpp / gguf-py: Support for Qwen2.5 VL - WIP / REVIEW NEEDED (#11483)
examples
python
python script changes
#12119
opened Feb 28, 2025 by
vladislavdonchev
•
Draft
sycl: cleanup oneDNN related code
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
opencl:Fix profile-related errors
ggml
changes relating to the ggml tensor library for machine learning
#12095
opened Feb 27, 2025 by
simon886212
Loading…
cmake : fix undefined reference errors for std::filesystem in ggml (#12092)
ggml
changes relating to the ggml tensor library for machine learning
#12094
opened Feb 27, 2025 by
hbuxiaofei
Loading…
vulkan: subgroup size test
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12087
opened Feb 26, 2025 by
daniandtheweb
•
Draft
llama : expose API to retrieve devices associated with the model.
#12073
opened Feb 25, 2025 by
vlovich
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.