Skip to content

Actions: ruisearch42/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
42 workflow runs
42 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Kernel]: Cutlass 2:4 Sparsity + FP8/Int8 Quant Support (#10995)
clang-format #42: Commit 60508ff pushed by ruisearch42
December 18, 2024 16:21 14s main
December 18, 2024 16:21 14s
[Distributed] Allow the placement group more time to wait for resourc…
clang-format #41: Commit 0d8451c pushed by ruisearch42
December 13, 2024 21:42 15s main
December 13, 2024 21:42 15s
[Bugfix] Fix Mamba multistep (#11071)
clang-format #40: Commit 9a93973 pushed by ruisearch42
December 11, 2024 01:29 20s main
December 11, 2024 01:29 20s
[Model] Add Internlm2 LoRA support (#5064)
clang-format #39: Commit c83919c pushed by ruisearch42
November 29, 2024 01:53 14s main
November 29, 2024 01:53 14s
[V1] Refactor model executable interface for multimodal models (#10570)
clang-format #38: Commit 2f0a0a1 pushed by ruisearch42
November 27, 2024 01:44 15s main
November 27, 2024 01:44 15s
[Doc] Update README.md with Ray Summit talk links (#10610)
clang-format #37: Commit 49628fe pushed by ruisearch42
November 25, 2024 01:10 18s main
November 25, 2024 01:10 18s
[Docs] Misc updates to TPU installation instructions (#10165)
clang-format #36: Commit 4f168f6 pushed by ruisearch42
November 16, 2024 02:02 18s main
November 16, 2024 02:02 18s
[Docs] Publish meetup slides (#10331)
clang-format #35: Commit 1dbae03 pushed by ruisearch42
November 14, 2024 16:20 22s main
November 14, 2024 16:20 22s
[BugFix][Kernel] Fix Illegal memory access in causal_conv1d in H100 (…
clang-format #34: Commit 9fb12f7 pushed by ruisearch42
October 31, 2024 21:03 15s main
October 31, 2024 21:03 15s
[Bugfix] Bandaid fix for speculative decoding tests (#9327)
clang-format #33: Commit 16b24e7 pushed by ruisearch42
October 13, 2024 23:50 19s main
October 13, 2024 23:50 19s
[Frontend] Expose revision arg in OpenAI server (#8501)
clang-format #32: Commit 837c196 pushed by ruisearch42
September 16, 2024 16:03 16s main
September 16, 2024 16:03 16s
[Documentation][Spec Decode] Add documentation about lossless guarant…
clang-format #31: Commit 2febcf2 pushed by ruisearch42
September 5, 2024 23:55 15s main
September 5, 2024 23:55 15s
chore: Update check-wheel-size.py to read MAX_SIZE_MB from env (#8103)
clang-format #30: Commit ccd7207 pushed by ruisearch42
September 4, 2024 06:32 13s main
September 4, 2024 06:32 13s
[Bugfix][VLM] Add fallback to SDPA for ViT model running on CPU backe…
clang-format #29: Commit ec26653 pushed by ruisearch42
September 3, 2024 17:21 15s main
September 3, 2024 17:21 15s
[Neuron] Adding support for context-lenght, token-gen buckets. (#7885)
clang-format #28: Commit 257afc3 pushed by ruisearch42
August 29, 2024 21:08 19s main
August 29, 2024 21:08 19s
[Core][Kernels] Use FlashInfer backend for FP8 KV Cache when availabl…
clang-format #27: Commit b98cc28 pushed by ruisearch42
August 28, 2024 17:25 19s main
August 28, 2024 17:25 19s
[Bugfix] Allow ScalarType to be compiled with pytorch 2.3 and add che…
clang-format #26: Commit c166e7e pushed by ruisearch42
August 28, 2024 04:35 20s main
August 28, 2024 04:35 20s
[Bugfix] Fix xpu build (#7644)
clang-format #25: Commit 1a36287 pushed by ruisearch42
August 19, 2024 14:52 18s main
August 19, 2024 14:52 18s
[ Bugfix ] Fix Prometheus Metrics With zeromq Frontend (#7279)
clang-format #24: Commit e3b3182 pushed by ruisearch42
August 18, 2024 23:33 17s main
August 18, 2024 23:33 17s
[core][misc] update libcudart finding (#7620)
clang-format #23: Commit d95cc0a pushed by ruisearch42
August 17, 2024 15:36 24s main
August 17, 2024 15:36 24s
[Kernel] fix types used in aqlm and ggml kernels to support dynamo (#…
clang-format #22: Commit 37fd47e pushed by ruisearch42
August 16, 2024 21:10 13s main
August 16, 2024 21:10 13s
support tqdm in notebooks (#7510)
clang-format #21: Commit ec724a7 pushed by ruisearch42
August 16, 2024 16:42 19s main
August 16, 2024 16:42 19s
[Bugfix] Fix default weight loading for scalars (#7534)
clang-format #20: Commit 21313e0 pushed by ruisearch42
August 15, 2024 23:33 16s main
August 15, 2024 23:33 16s
[Misc] Revert compressed-tensors code reuse (#7521)
clang-format #19: Commit f55a9ae pushed by ruisearch42
August 14, 2024 22:35 17s main
August 14, 2024 22:35 17s
[core] [2/N] refactor worker_base input preparation for multi-step (#…
clang-format #18: Commit c08e2b3 pushed by ruisearch42
August 11, 2024 17:00 14s main
August 11, 2024 17:00 14s