Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

common: add LLAMA_LOG_FILE env var
#17609 opened Nov 30, 2025 by taronaeo Loading…
common : add minimalist multi-thread progress bar
#17602 opened Nov 29, 2025 by angt Loading…
clip: fix nb calculation for qwen3-vl examples
#17594 opened Nov 29, 2025 by ngxson Loading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaview Loading…
Override SSM_A op for Qwen3 Next to reduce splits model Model specific
#17587 opened Nov 29, 2025 by pwilkin Loading…
Improve Qwen3-Next Speed model Model specific
#17585 opened Nov 29, 2025 by lovedheart Draft
Add support for CUMSUM and TRI for CUDA. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17584 opened Nov 28, 2025 by pwilkin Loading…
Add safetensors support
#17580 opened Nov 28, 2025 by ericcurtin Draft
Add PagedAttention support (experimental, CUDA only) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17579 opened Nov 28, 2025 by ericcurtin Loading…
model: LFM2-VL fixes examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17577 opened Nov 28, 2025 by tdakhran Loading…
HIP: enable WMMA-MMQ INT kernels for RDNA 3 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17576 opened Nov 28, 2025 by jiachengjason Draft
mtmd: support dots.ocr examples python python script changes
#17575 opened Nov 28, 2025 by ngxson Draft
[SYCL] enhance argsort for UT ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17573 opened Nov 28, 2025 by NeoZhangJianyu Loading…
Server: Change Invalid Schema from Server Error (500) to User Error (400) examples python python script changes server testing Everything test related
#17572 opened Nov 28, 2025 by chadvoegele Loading…
ggml-hexagon: fix rope failure at test-backend-ops ggml changes relating to the ggml tensor library for machine learning
#17565 opened Nov 28, 2025 by chraac Loading…
CANN: The Ger operator of OUT_PROD is not supported on the 310p device Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17563 opened Nov 28, 2025 by TianHao324 Loading…
New llama-run examples server
#17554 opened Nov 27, 2025 by ericcurtin Loading…
cmake : add option to build and link LibreSSL
#17552 opened Nov 27, 2025 by angt Loading…
ggml-cpu: Add operator-level execution time profiling ggml changes relating to the ggml tensor library for machine learning
#17544 opened Nov 27, 2025 by kimminsu38oo Loading…
CANN: add support for partial RoPE and Vision mode Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17543 opened Nov 27, 2025 by noemotiovon Loading…
vulkan: Fix mismatch in TOPK_MOE unit test ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17541 opened Nov 27, 2025 by rillomas Draft
llama.cpp with sentencepiece testing Everything test related
#17529 opened Nov 26, 2025 by awenzel67 Loading…
ggml-cpu: BMI2 is only available on amd64 ggml changes relating to the ggml tensor library for machine learning
#17528 opened Nov 26, 2025 by candrews Loading…
ProTip! What’s not been updated in a month: updated:<2025-10-29.