Skip to content

Actions: neuralmagic/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
247 workflow runs
247 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Core] Support Lora lineage and base model metadata management (#6315)
clang-format #98: Commit 260d40b pushed by ElizaWszola
September 20, 2024 07:19 15s main
September 20, 2024 07:19 15s
Create SECURITY.md (#8642)
clang-format #97: Commit 9e99407 pushed by tlrmchlsmth
September 19, 2024 19:46 16s main
September 19, 2024 19:46 16s
September 19, 2024 18:11 16s
Revert "[Misc][Bugfix] Disable guided decoding for mistral tokenizer"…
clang-format #95: Commit 02c9afa pushed by ElizaWszola
September 19, 2024 04:35 19s main
September 19, 2024 04:35 19s
[BugFix] Nonzero exit code if MQLLMEngine startup fails (#8572)
clang-format #94: Commit d9cd78e pushed by tlrmchlsmth
September 18, 2024 20:20 18s main
September 18, 2024 20:20 18s
[Misc] Don't dump contents of kvcache tensors on errors (#8527)
clang-format #93: Commit 56c3de0 pushed by tlrmchlsmth
September 17, 2024 21:43 18s main
September 17, 2024 21:43 18s
DO NOT MERGE : Layer-by-Layer Profiling
clang-format #92: Pull request #3 synchronize by LucasWilkinson
September 17, 2024 15:03 20s varun/main-with-profiler
September 17, 2024 15:03 20s
DO NOT MERGE : Layer-by-Layer Profiling
clang-format #91: Pull request #3 synchronize by varun-sundar-rabindranath
September 16, 2024 21:29 45m 46s varun/main-with-profiler
September 16, 2024 21:29 45m 46s
DO NOT MERGE : Layer-by-Layer Profiling
clang-format #90: Pull request #3 synchronize by LucasWilkinson
September 16, 2024 21:11 48m 41s varun/main-with-profiler
September 16, 2024 21:11 48m 41s
DO NOT MERGE : Layer-by-Layer Profiling
clang-format #89: Pull request #3 synchronize by LucasWilkinson
September 16, 2024 20:51 25s varun/main-with-profiler
September 16, 2024 20:51 25s
[BugFix] Fix clean shutdown issues (#8492)
clang-format #88: Commit acd5511 pushed by tlrmchlsmth
September 16, 2024 17:27 17s main
September 16, 2024 17:27 17s
DO NOT MERGE : Layer-by-Layer Profiling
clang-format #87: Pull request #3 synchronize by LucasWilkinson
September 16, 2024 16:14 19s varun/main-with-profiler
September 16, 2024 16:14 19s
[Kernel] Enable 8-bit weights in Fused Marlin MoE (#8032)
clang-format #86: Commit a091e2d pushed by alexm-neuralmagic
September 16, 2024 15:49 18s main
September 16, 2024 15:49 18s
September 16, 2024 01:57 15s
bump version to v0.6.1.post2 (#8473)
clang-format #78: Commit 9ba0817 pushed by tlrmchlsmth
September 13, 2024 19:35 17s main
September 13, 2024 19:35 17s
[Doc] Add oneDNN installation to CPU backend documentation (#8467)
clang-format #77: Commit f57092c pushed by dsikka
September 13, 2024 18:18 17s main
September 13, 2024 18:18 17s
[multi-step] add flashinfer backend (#7928)
clang-format #76: Commit a6c0f36 pushed by alexm-neuralmagic
September 12, 2024 18:40 14s main
September 12, 2024 18:40 14s
[torch.compile] hide slicing under custom op for inductor (#8384)
clang-format #74: Commit 7de49aa pushed by alexm-neuralmagic
September 12, 2024 13:05 18s main
September 12, 2024 13:05 18s