Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

server : implement cancellable request examples python python script changes server
#11285 opened Jan 17, 2025 by ngxson Loading…
vulkan: fix coopmat2 validation failures ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11284 opened Jan 17, 2025 by jeffbolznv Loading…
cmake : add sanitizer flags for llama.cpp build Compilation issues devops improvements to build systems and github actions examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#11279 opened Jan 17, 2025 by ggerganov Loading…
SYCL: SOFTMAX F16 mask support and other fixes ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11261 opened Jan 16, 2025 by qnixsynapse Loading…
Adding linenoise.cpp to llama-run build Compilation issues devops improvements to build systems and github actions examples ggml changes relating to the ggml tensor library for machine learning
#11252 opened Jan 15, 2025 by ericcurtin Loading…
SYCL: Introducing memory host pool ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11251 opened Jan 15, 2025 by s-Nick Loading…
2 of 4 tasks
fix makefile and cmake logic for AARCH64 ggml changes relating to the ggml tensor library for machine learning
#11246 opened Jan 15, 2025 by Haus1 Loading…
AMD: parse the architecture as supplied by gcnArchName ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes script Script related testing Everything test related
#11244 opened Jan 14, 2025 by Haus1 Loading…
Allow s390x to load little endian models unmodified ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#11234 opened Jan 14, 2025 by AlekseiNikiforovIBM Loading…
Build linux CUDA releases suitable for Colab & other platforms on 12.2 devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#11226 opened Jan 14, 2025 by ochafik Draft
4 tasks
sampling: add Top-nσ sampler examples testing Everything test related
#11223 opened Jan 14, 2025 by VJHack Loading…
llama : refactor llama_kv_cache, llama_context and llm_build_context android Issues specific to Android examples server
#11213 opened Jan 13, 2025 by ggerganov Loading…
2 of 8 tasks
cmake : enable -Wshadow for C++ code build Compilation issues devops improvements to build systems and github actions examples ggml changes relating to the ggml tensor library for machine learning server
#11193 opened Jan 11, 2025 by ggerganov Draft
2 tasks done
Allow compiling cuda without mmq and flash attention ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11190 opened Jan 11, 2025 by milot-mirdita Loading…
CUDA op getrows fails for long sequences ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11189 opened Jan 11, 2025 by milot-mirdita Loading…
Fix ggml-cuda using a driver symbol in NO_VMM mode ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11188 opened Jan 11, 2025 by milot-mirdita Loading…
ggml doesn't use sse42, specify only up to sse4.1 ggml changes relating to the ggml tensor library for machine learning
#11187 opened Jan 11, 2025 by milot-mirdita Loading…
Fix GGML not compiling on macOS with GCC ggml changes relating to the ggml tensor library for machine learning
#11185 opened Jan 11, 2025 by milot-mirdita Loading…
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU Apple Metal https://en.wikipedia.org/wiki/Metal_(API) enhancement New feature or request ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs performance Speed related topics python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs testing Everything test related
#11183 opened Jan 10, 2025 by compilade Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.