Skip to content

Actions: vllm-project/vllm

Cleanup PR Body

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
839 workflow run results
839 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[CI/Build] Print running script to enhance CI log readability
Cleanup PR Body #739: Pull request #10594 opened by jeejeelee
November 23, 2024 15:39 12s
November 23, 2024 15:39 12s
[Bugfix] Avoid import AttentionMetadata explicitly in Mllama and fix openvino import
Cleanup PR Body #738: Pull request #10593 edited by Isotr0py
November 23, 2024 12:06 1m 2s
November 23, 2024 12:06 1m 2s
[Bugfix] Avoid import AttentionMetadata explicitly in Mllama and fix openvino import
Cleanup PR Body #737: Pull request #10593 opened by Isotr0py
November 23, 2024 12:02 14s
November 23, 2024 12:02 14s
Interleaving sliding window for Ministral-8B-Instruct-2410
Cleanup PR Body #736: Pull request #10591 opened by patrickvonplaten
November 23, 2024 10:59 14s
November 23, 2024 10:59 14s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #735: Pull request #10570 edited by ywang96
November 23, 2024 09:27 16s
November 23, 2024 09:27 16s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #734: Pull request #10570 edited by ywang96
November 23, 2024 07:43 1m 6s
November 23, 2024 07:43 1m 6s
[CI/Build] For ppc64le, disabled tests for now and addressed space issues
Cleanup PR Body #733: Pull request #10538 edited by DarkLight1337
November 23, 2024 06:07 17s
November 23, 2024 06:07 17s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #732: Pull request #10570 edited by ywang96
November 23, 2024 03:57 12s
November 23, 2024 03:57 12s
[Kernel] Remove hard-dependencies of Speculative decode to CUDA workers
Cleanup PR Body #731: Pull request #10587 edited by xuechendi
November 23, 2024 02:29 13s
November 23, 2024 02:29 13s
[core] gemma2 full context length support
Cleanup PR Body #730: Pull request #10584 edited by DarkLight1337
November 23, 2024 02:25 14s
November 23, 2024 02:25 14s
[Kernel]Generalize Speculative decode from Cuda
Cleanup PR Body #729: Pull request #10094 edited by xuechendi
November 23, 2024 02:20 15s
November 23, 2024 02:20 15s
[Kernel] Remove hard-dependencies of Speculative decode to CUDA workers
Cleanup PR Body #728: Pull request #10587 opened by xuechendi
November 23, 2024 02:19 15s
November 23, 2024 02:19 15s
【Kernel】Tuning fused moe for qwen2-57b in GTX 4090 (tp4pp2)
Cleanup PR Body #727: Pull request #10586 opened by BBuf
November 23, 2024 01:28 15s
November 23, 2024 01:28 15s
[Misc] Add pynccl wrappers for all_gather and reduce_scatter
Cleanup PR Body #726: Pull request #9432 edited by tlrmchlsmth
November 23, 2024 01:12 1m 6s
November 23, 2024 01:12 1m 6s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #725: Pull request #10570 edited by ywang96
November 23, 2024 00:38 14s
November 23, 2024 00:38 14s
[bugfix] fix cpu tests
Cleanup PR Body #724: Pull request #10585 opened by youkaichao
November 23, 2024 00:28 13s
November 23, 2024 00:28 13s
[core] gemma2 full context length support
Cleanup PR Body #723: Pull request #10584 opened by youkaichao
November 22, 2024 23:57 14s
November 22, 2024 23:57 14s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #722: Pull request #10570 edited by ywang96
November 22, 2024 23:39 1m 7s
November 22, 2024 23:39 1m 7s
Adding cascade inference to vLLM
Cleanup PR Body #721: Pull request #10011 edited by raywanb
November 22, 2024 22:20 1m 8s
November 22, 2024 22:20 1m 8s
[Bugfix][Frontend] Update Llama Chat Templates to also support Non-Tool use
Cleanup PR Body #720: Pull request #10164 edited by tjohnson31415
November 22, 2024 21:42 13s
November 22, 2024 21:42 13s
[Bugfix][Frontend] Update Llama Chat Templates to also support Non-Tool use
Cleanup PR Body #719: Pull request #10164 edited by tjohnson31415
November 22, 2024 21:31 18s
November 22, 2024 21:31 18s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #718: Pull request #10570 edited by ywang96
November 22, 2024 19:57 13s
November 22, 2024 19:57 13s
[V1] Refactor model executable interface for multimodal models
Cleanup PR Body #717: Pull request #10570 edited by ywang96
November 22, 2024 19:21 1m 2s
November 22, 2024 19:21 1m 2s
[WIP] V1 LoRA support
Cleanup PR Body #716: Pull request #10579 edited by varun-sundar-rabindranath
November 22, 2024 18:12 14s
November 22, 2024 18:12 14s
[bugfix] fix full graph tests
Cleanup PR Body #715: Pull request #10581 opened by youkaichao
November 22, 2024 17:57 18s
November 22, 2024 17:57 18s