Skip to content

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #50

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #50