Skip to content

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #61

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #61

Triggered via pull request November 8, 2024 01:56
Status Success
Total duration 16s
Artifacts

cleanup_pr_body.yml

on: pull_request
update-description
6s
update-description
Fit to window
Zoom out
Zoom in