Skip to content

Actions: B-201/vllm

Lint documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
22 workflow runs
22 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Doc] Add model development API Reference (#11884)
Lint documentation #22: Commit 65097ca pushed by B-201
January 9, 2025 10:51 23s main
January 9, 2025 10:51 23s
[V1] Add BlockTable class (#11693)
Lint documentation #21: Commit 06bfb51 pushed by B-201
January 6, 2025 06:35 22s main
January 6, 2025 06:35 22s
[VLM] Move supported limits and max tokens to merged multi-modal proc…
Lint documentation #20: Commit a115ac4 pushed by B-201
January 2, 2025 02:04 25s main
January 2, 2025 02:04 25s
[Build][Kernel] Update CUTLASS to v3.6.0 (#11607)
Lint documentation #19: Commit 970d6d0 pushed by B-201
December 30, 2024 10:01 20s main
December 30, 2024 10:01 20s
[CI] Unboock H100 Benchmark (#11419)
Lint documentation #18: Commit 048fc57 pushed by B-201
December 23, 2024 01:56 19s main
December 23, 2024 01:56 19s
[Model] Add JambaForSequenceClassification model (#10860)
Lint documentation #17: Commit 6c7f881 pushed by B-201
December 19, 2024 14:58 23s main
December 19, 2024 14:58 23s
[Bugfix] Cleanup Pixtral HF code (#11333)
Lint documentation #16: Commit a0f7d53 pushed by B-201
December 19, 2024 14:37 25s main
December 19, 2024 14:37 25s
[Bugfix] Fix block size validation (#10938)
Lint documentation #15: Commit 69ba344 pushed by B-201
December 16, 2024 02:21 23s main
December 16, 2024 02:21 23s
PaliGemma 2 support (#11142)
Lint documentation #14: Commit 7cd7409 pushed by B-201
December 13, 2024 07:44 22s main
December 13, 2024 07:44 22s
[CI/Build] Check transformers v4.47 (#10991)
Lint documentation #13: Commit 2e33fe4 pushed by B-201
December 11, 2024 06:47 19s main
December 11, 2024 06:47 19s
Add example of helm chart for vllm deployment on k8s (#9199)
Lint documentation #12: Commit fe2e10c pushed by B-201
December 10, 2024 09:20 25s main
December 10, 2024 09:20 25s
[Model] Add has_weight to RMSNorm and re-enable weights loading track…
Lint documentation #11: Commit d1f6d1c pushed by B-201
December 10, 2024 02:41 20s main
December 10, 2024 02:41 20s
[misc] clean up and unify logging (#10999)
Lint documentation #10: Commit 46004e8 pushed by B-201
December 9, 2024 01:38 22s main
December 9, 2024 01:38 22s
[Misc] Update llama 3.2 template to support system prompt with images…
Lint documentation #9: Commit 39c89e7 pushed by B-201
December 5, 2024 07:18 23s main
December 5, 2024 07:18 23s
[ci/build] Update vLLM postmerge ECR repo (#10887)
Lint documentation #8: Commit c92acb9 pushed by B-201
December 4, 2024 15:11 28s main
December 4, 2024 15:11 28s
[V1] VLM - Run the mm_mapper preprocessor in the frontend process (#1…
Lint documentation #7: Commit 3bc94ca pushed by B-201
December 3, 2024 10:47 19s main
December 3, 2024 10:47 19s
[misc] use out argument for flash attention (#10822)
Lint documentation #6: Commit a4c4daf pushed by B-201
December 2, 2024 14:01 27s main
December 2, 2024 14:01 27s
[Misc] typo find in sampling_metadata.py (#10740)
Lint documentation #5: Commit c82b432 pushed by B-201
November 29, 2024 11:17 18s main
November 29, 2024 11:17 18s
[Frontend] don't block event loop in tokenization (preprocess) in Ope…
Lint documentation #4: Commit 395b1c7 pushed by B-201
November 28, 2024 03:51 20s main
November 28, 2024 03:51 20s
[V1] Refactor model executable interface for multimodal models (#10570)
Lint documentation #3: Commit 2f0a0a1 pushed by B-201
November 27, 2024 01:36 26s main
November 27, 2024 01:36 26s
[Minor] Fix line-too-long (#10563)
Lint documentation #2: Commit 446c780 pushed by B-201
November 22, 2024 03:58 19s main
November 22, 2024 03:58 19s
[TPU] Implement prefix caching for TPUs (#10307)
Lint documentation #1: Commit 2f77b6c pushed by B-201
November 21, 2024 02:21 23s main
November 21, 2024 02:21 23s