Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

HIP/CUDA: set the paramerter value in maintain_cuda_graph instead of replaceing it. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12209 opened Mar 5, 2025 by IMbackK Loading…
feat(CMakeLists): Add MSVC-specific compiler warning flags in CMake configuration ggml changes relating to the ggml tensor library for machine learning
#12206 opened Mar 5, 2025 by 25077667 Loading…
opencl: Fix not enough space in the buffer ggml changes relating to the ggml tensor library for machine learning
#12197 opened Mar 5, 2025 by linehill Loading…
metal : simplify kernel arguments using a struct (#3229) Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12194 opened Mar 5, 2025 by BB-fat Loading…
SYCL: Rename oneMKL to oneMath documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192 opened Mar 5, 2025 by Rbiessy Loading…
vulkan: double buffer scale caches ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12188 opened Mar 4, 2025 by netrunnereve Loading…
CUDA: Fix new mma detection for Turing cards with Volta PTX ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12187 opened Mar 4, 2025 by neilmehta24 Loading…
fix: AVX2 intrinsics, const correctness, and SIMD headers build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#12186 opened Mar 4, 2025 by sandboxyer Loading…
CUDA: Improve flash decoding kernel GPU occupancy for BS=1 case ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12183 opened Mar 4, 2025 by gaugarg-nv Loading…
1 of 3 tasks
HIP: rocWMMA documentation and enabling in workflow builds devops improvements to build systems and github actions documentation Improvements or additions to documentation
#12179 opened Mar 4, 2025 by hjc4869 Loading…
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12177 opened Mar 4, 2025 by IMbackK Loading…
opencl: Fix ulong kernel args were set from int variables ggml changes relating to the ggml tensor library for machine learning
#12174 opened Mar 4, 2025 by linehill Loading…
1 task done
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions ggml changes relating to the ggml tensor library for machine learning
#12154 opened Mar 2, 2025 by remyoudompheng Loading…
Vulkan: Add DP4A MMQ and Q8_1 quantization shader ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12135 opened Mar 1, 2025 by 0cc4m Draft
6 tasks
Server: openai-style lookup decoding examples python python script changes server
#12127 opened Mar 1, 2025 by eeroel Loading…
sycl: cleanup oneDNN related code documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12097 opened Feb 27, 2025 by sgeor255 Draft
opencl:Fix profile-related errors ggml changes relating to the ggml tensor library for machine learning
#12095 opened Feb 27, 2025 by simon886212 Loading…
cmake : fix undefined reference errors for std::filesystem in ggml (#12092) ggml changes relating to the ggml tensor library for machine learning
#12094 opened Feb 27, 2025 by hbuxiaofei Loading…
vulkan: subgroup size test ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12087 opened Feb 26, 2025 by daniandtheweb Draft
Cache based tokenization for the server input prompts demo Demonstrate some concept or idea, not intended to be merged examples server
#12067 opened Feb 25, 2025 by vnicolici Loading…
ProTip! Follow long discussions with comments:>50.