Skip to content

Actions: neuralmagic/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
241 workflow runs
241 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Dynamic group blocks in Marlin MoE
clang-format #117: Pull request #11 synchronize by ElizaWszola
September 25, 2024 11:56 26s marlin-moe-dynamic-group-blocks
September 25, 2024 11:56 26s
Dynamic group blocks in Marlin MoE
clang-format #116: Pull request #11 synchronize by ElizaWszola
September 25, 2024 11:54 25s marlin-moe-dynamic-group-blocks
September 25, 2024 11:54 25s
Add zero point support to Marlin MoE kernel
clang-format #115: Pull request #10 synchronize by ElizaWszola
September 25, 2024 06:43 19s marlin-moe-zero-points
September 25, 2024 06:43 19s
[Frontend] Batch inference for llm.chat() API (#8648)
clang-format #114: Commit 2529d09 pushed by alexm-neuralmagic
September 24, 2024 16:58 20s main
September 24, 2024 16:58 20s
[Kernel] Split Marlin MoE kernels into multiple files (#8661)
clang-format #113: Commit a928ded pushed by ElizaWszola
September 24, 2024 16:36 20s main
September 24, 2024 16:36 20s
[Core][Model] Support loading weights by ID within models (#7931)
clang-format #112: Commit 3f06bae pushed by ElizaWszola
September 24, 2024 07:28 16s main
September 24, 2024 07:28 16s
[Bugfix][CPU] fix missing input intermediate_tensors in the cpu_model…
clang-format #111: Commit 3e83c12 pushed by alexm-neuralmagic
September 23, 2024 13:16 17s main
September 23, 2024 13:16 17s
[Bugfix] Fix CPU CMake build (#8723)
clang-format #110: Commit 57a0702 pushed by ElizaWszola
September 23, 2024 05:04 16s main
September 23, 2024 05:04 16s
Update cpu_extension.cmake
clang-format #109: Pull request #12 synchronize by ProExpertProg
September 23, 2024 01:44 22s ProExpertProg-patch-1
September 23, 2024 01:44 22s
Update cpu_extension.cmake
clang-format #108: Pull request #12 opened by ProExpertProg
September 23, 2024 01:28 16s ProExpertProg-patch-1
September 23, 2024 01:28 16s
[Kernel][Bugfix] Delete some more useless code in marlin_moe_ops.cu (…
clang-format #107: Commit d66ac62 pushed by tlrmchlsmth
September 22, 2024 02:42 16s main
September 22, 2024 02:42 16s
[Kernel][Triton][AMD] Remove tl.atomic_add from awq_gemm_kernel, 2-5x…
clang-format #104: Commit ec4aaad pushed by mgoin
September 21, 2024 21:08 18s main
September 21, 2024 21:08 18s
[Model] Add OLMoE (#7922)
clang-format #103: Commit 3b63de9 pushed by tlrmchlsmth
September 20, 2024 16:33 20s main
September 20, 2024 16:33 20s
Dynamic group blocks in Marlin MoE
clang-format #101: Pull request #11 synchronize by ElizaWszola
September 20, 2024 15:25 16s marlin-moe-dynamic-group-blocks
September 20, 2024 15:25 16s
Dynamic group blocks in Marlin MoE
clang-format #99: Pull request #11 opened by ElizaWszola
September 20, 2024 12:05 18s marlin-moe-dynamic-group-blocks
September 20, 2024 12:05 18s
[Core] Support Lora lineage and base model metadata management (#6315)
clang-format #98: Commit 260d40b pushed by ElizaWszola
September 20, 2024 07:19 15s main
September 20, 2024 07:19 15s
Create SECURITY.md (#8642)
clang-format #97: Commit 9e99407 pushed by tlrmchlsmth
September 19, 2024 19:46 16s main
September 19, 2024 19:46 16s
September 19, 2024 18:11 16s
Revert "[Misc][Bugfix] Disable guided decoding for mistral tokenizer"…
clang-format #95: Commit 02c9afa pushed by ElizaWszola
September 19, 2024 04:35 19s main
September 19, 2024 04:35 19s
[BugFix] Nonzero exit code if MQLLMEngine startup fails (#8572)
clang-format #94: Commit d9cd78e pushed by tlrmchlsmth
September 18, 2024 20:20 18s main
September 18, 2024 20:20 18s
[Misc] Don't dump contents of kvcache tensors on errors (#8527)
clang-format #93: Commit 56c3de0 pushed by tlrmchlsmth
September 17, 2024 21:43 18s main
September 17, 2024 21:43 18s