Skip to content

Actions: neuralmagic/vllm

Lint documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
30 workflow runs
30 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Doc] [1/N] Initial guide for merged multi-modal processor (#11925)
Lint documentation #30: Commit 12664dd pushed by tlrmchlsmth
January 10, 2025 15:09 20s main
January 10, 2025 15:09 20s
[ci] fix gh200 tests (#11919)
Lint documentation #29: Commit d53575a pushed by varun-sundar-rabindranath
January 10, 2025 09:14 26s main
January 10, 2025 09:14 26s
[Doc] Add model development API Reference (#11884)
Lint documentation #28: Commit 65097ca pushed by mgoin
January 9, 2025 13:38 22s main
January 9, 2025 13:38 22s
[CI] Turn on basic correctness tests for V1 (#10864)
Lint documentation #27: Commit 615e4a5 pushed by varun-sundar-rabindranath
January 9, 2025 02:22 18s main
January 9, 2025 02:22 18s
[Doc][4/N] Reorganize API Reference (#11843)
Lint documentation #26: Commit 6cd40a5 pushed by tlrmchlsmth
January 8, 2025 15:18 18s main
January 8, 2025 15:18 18s
[VLM] Separate out profiling-related logic (#11746)
Lint documentation #25: Commit 996357e pushed by varun-sundar-rabindranath
January 6, 2025 13:25 24s main
January 6, 2025 13:25 24s
[V1] Simplify Shutdown (#11659)
Lint documentation #24: Commit 80c751e pushed by tlrmchlsmth
January 3, 2025 18:08 19s main
January 3, 2025 18:08 19s
According to vllm.EngineArgs, the name should be distributed_executor…
Lint documentation #23: Commit 84c35c3 pushed by tlrmchlsmth
January 2, 2025 18:26 19s main
January 2, 2025 18:26 19s
[benchmark] Remove dependency for H100 benchmark step (#11572)
Lint documentation #22: Commit ccb1aab pushed by varun-sundar-rabindranath
December 31, 2024 01:11 20s main
December 31, 2024 01:11 20s
[Misc] KV cache transfer connector registry (#11481)
Lint documentation #21: Commit faef77c pushed by tlrmchlsmth
December 29, 2024 18:59 18s main
December 29, 2024 18:59 18s
[VLM] Support caching in merged multi-modal processor (#11396)
Lint documentation #20: Commit 1014180 pushed by varun-sundar-rabindranath
December 27, 2024 17:39 19s main
December 27, 2024 17:39 19s
[Bugfix] Fix issues for Pixtral-Large-Instruct-2411 (#11393)
Lint documentation #19: Commit c2d1b07 pushed by varun-sundar-rabindranath
December 21, 2024 18:46 19s main
December 21, 2024 18:46 19s
[Misc] Allow passing logits_soft_cap for xformers backend (#11252)
Lint documentation #18: Commit f9ecbb1 pushed by varun-sundar-rabindranath
December 17, 2024 14:51 22s main
December 17, 2024 14:51 22s
[Doc] Reorder vision language examples in alphabet order (#11228)
Lint documentation #17: Commit 2ca830d pushed by tlrmchlsmth
December 16, 2024 15:34 23s main
December 16, 2024 15:34 23s
[Bugfix] Fix block size validation (#10938)
Lint documentation #16: Commit 69ba344 pushed by varun-sundar-rabindranath
December 16, 2024 01:40 32s main
December 16, 2024 01:40 32s
[Bugfix][Hardware][CPU] Enable Gemma2 with SDPA on CPU backend (#11169)
Lint documentation #15: Commit 0a56bcc pushed by ProExpertProg
December 13, 2024 19:11 18s main
December 13, 2024 19:11 18s
[V1] Fix torch profiling for offline inference (#11125)
Lint documentation #14: Commit 4816d20 pushed by tlrmchlsmth
December 12, 2024 15:57 24s main
December 12, 2024 15:57 24s
[core] Bump ray to use _overlap_gpu_communication in compiled graph t…
Lint documentation #13: Commit 72ff3a9 pushed by tlrmchlsmth
December 11, 2024 20:05 19s main
December 11, 2024 20:05 19s
[BUG] Remove token param #10921 (#11022)
Lint documentation #12: Commit 250ee65 pushed by tlrmchlsmth
December 10, 2024 18:11 25s main
December 10, 2024 18:11 25s
[V1] Input Batch Relocation (#10962)
Lint documentation #11: Commit 25b79d9 pushed by tlrmchlsmth
December 9, 2024 18:51 22s main
December 9, 2024 18:51 22s
[core][executor] simplify instance id (#10976)
Lint documentation #10: Commit 1b62745 pushed by tlrmchlsmth
December 7, 2024 18:01 21s main
December 7, 2024 18:01 21s
[ci] fix broken tests (#10956)
Lint documentation #9: Commit dcdc3fa pushed by tlrmchlsmth
December 6, 2024 19:45 19s main
December 6, 2024 19:45 19s
[CI/Build] improve python-only dev setup (#9621)
Lint documentation #8: Commit e4c34c2 pushed by ProExpertProg
December 4, 2024 22:46 19s main
December 4, 2024 22:46 19s
[torch.compile] Dynamic fp8 + rms_norm fusion
Lint documentation #7: Pull request #31 synchronize by ProExpertProg
December 4, 2024 22:45 20s luka/rms-norm-fusion-refactor
December 4, 2024 22:45 20s
[V1] VLM - Run the mm_mapper preprocessor in the frontend process (#1…
Lint documentation #6: Commit 3bc94ca pushed by tlrmchlsmth
December 3, 2024 14:50 26s main
December 3, 2024 14:50 26s