-
Notifications
You must be signed in to change notification settings - Fork 9.8k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
server: Add "tokens per second" information in the backend
examples
server
#10548
opened Nov 27, 2024 by
lhpqaq
Loading…
2 of 4 tasks
llava: return false instead of exit
examples
#10546
opened Nov 27, 2024 by
tinglou
Loading…
2 of 4 tasks
llama/ggml: add LLM training support
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
testing
Everything test related
#10544
opened Nov 27, 2024 by
JohannesGaessler
•
Draft
cmake : fix ARM feature detection
ggml
changes relating to the ggml tensor library for machine learning
#10543
opened Nov 27, 2024 by
ggerganov
Loading…
kompute: improve backend for pass test_backend_ops
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
#10542
opened Nov 27, 2024 by
slp
Loading…
2 of 4 tasks
ggml-cpu: support IQ4_NL_4_4 by runtime repack
ggml
changes relating to the ggml tensor library for machine learning
#10541
opened Nov 27, 2024 by
FanShupei
Loading…
2 of 4 tasks
[CANN] ROPE operator optimization
ggml
changes relating to the ggml tensor library for machine learning
#10540
opened Nov 27, 2024 by
noemotiovon
Loading…
2 of 4 tasks
Update cann.md to ensure it displays correctly on all platforms.
documentation
Improvements or additions to documentation
#10538
opened Nov 27, 2024 by
HRXWEB
Loading…
2 of 4 tasks
vulkan: Dynamic subgroup size support for Q6_K mat_vec
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#10536
opened Nov 27, 2024 by
netrunnereve
•
Draft
2 of 4 tasks
DO NOT MERGE Add olmo2 tokenizer to convert script (leaving open for discussion)
python
python script changes
#10535
opened Nov 26, 2024 by
bartowski1182
•
Draft
2 of 4 tasks
llama : use cmake for swift build
build
Compilation issues
devops
improvements to build systems and github actions
CANN: cann backend build failed when manually specify SOC_TYPE or gcc version that isn't verified
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#10519
opened Nov 26, 2024 by
leo-pony
Loading…
2 of 4 tasks
Opt class for positional argument handling
examples
#10508
opened Nov 26, 2024 by
ericcurtin
Loading…
2 of 4 tasks
vulkan: get the first command buffer submitted sooner
#10499
opened Nov 25, 2024 by
jeffbolznv
Loading…
2 of 4 tasks
fix: ggml: fix vulkan-shaders-gen build
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#10448
opened Nov 22, 2024 by
sparkleholic
Loading…
2 of 4 tasks
Integrating llama.cpp with Microsoft Word
#10443
opened Nov 21, 2024 by
GPTLocalhost
Loading…
2 of 4 tasks
sycl : offload of get_rows set to false
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10432
opened Nov 20, 2024 by
Alcpz
Loading…
2 of 4 tasks
bug-fix: snprintf prints NULL in place of the last character
examples
server
#10419
opened Nov 20, 2024 by
kallewoof
Loading…
2 of 4 tasks
sycl : permuted mul_mat through oneMKL
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10408
opened Nov 19, 2024 by
Alcpz
Loading…
2 of 4 tasks
server: Fix the status of finish_reason if the stream value is False
examples
server
#10382
opened Nov 18, 2024 by
SeongBeomLEE
Loading…
2 of 4 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.