Skip to content

Commit

Permalink
[auto] Sync version 2406031220.0.0+llamacpp-release.b3075
Browse files Browse the repository at this point in the history
== Relevant log messages from source repo:

commit 3d7ebf63123b8652fb7bbecef7ba731202309901
Author: 0cc4m <[email protected]>
Date:   Mon Jun 3 10:59:14 2024 +0200

    Vulkan Mixture of Experts (MoE) support (#7628)

    * Finish Vulkan mul_mat_id implementation

    * Add Vulkan sum_rows and div ops

    * Fix MUL_MAT_ID matrix matrix shader

    * Fix MUL_MAT_ID matrix vector shader dispatch size

    * Fix MUL_MAT_ID matrix vector shader and dispatch code

    * Update Vulkan CPU offload for MUL_MAT_ID

    * Fix crash when using split mode none and setting a main GPU

commit a10cda58d3199cd85305e0f03a8c6056714ae2e8
Author: Andy Tai <[email protected]>
Date:   Mon Jun 3 01:06:24 2024 -0700

    cmake : add pkg-config spec file for llama.cpp (#7702)

commit 6f28a333c1e3fdfdc7b4f9d0367f2b41a9b7e9d4
Author: zhangkaihuo <[email protected]>
Date:   Mon Jun 3 15:49:30 2024 +0800

    llama : MiniCPM support tied embeddings (#7664)

    * support lm_head

    * remove the code block

    ---------

    Co-authored-by: zhangkaihuo <[email protected]>

commit 549279d8049d78620a2b081e26edb654f83c3bbd
Author: Georgi Gerganov <[email protected]>
Date:   Mon Jun 3 08:34:43 2024 +0300

    llama : avoid double token-to-piece cache (#7654)

    ggml-ci

commit 9e405b6e2ecb888e860f7b92720b4809e21b3915
Author: woachk <[email protected]>
Date:   Mon Jun 3 07:32:16 2024 +0200

    kompute : implement op_getrows_f32 (#6403)

    op_getrows_f32 is required since ggerganov/llama.cpp#6122
    for the Vulkan w/ Kompute backend to be functional.

    As such, implement this op to make this backend functional again.
  • Loading branch information
github-actions committed Jun 3, 2024
1 parent 60e5755 commit 477e51c
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion Cargo.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[package]
name = "ggml-sys-bleedingedge"
version = "2406030051.0.0+llamacpp-release.b3070"
version = "2406031220.0.0+llamacpp-release.b3075"
description = "Bleeding edge low-level bindings to GGML. "
repository = "https://github.com/KerfuffleV2/ggml-sys-bleedingedge"
keywords = ["deep-learning", "machine-learning", "tensors", "ggml", "ml"]
Expand Down
2 changes: 1 addition & 1 deletion VERSION.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
2406030051.0.0+llamacpp-release.b3070
2406031220.0.0+llamacpp-release.b3075
2 changes: 1 addition & 1 deletion ggml-tag-current.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
b3070
b3075
2 changes: 1 addition & 1 deletion ggml-tag-previous.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
b3063
b3070

0 comments on commit 477e51c

Please sign in to comment.