Skip to content

[Bugfix] Change kv scaling factor by param json on nvidia gpu#11688

Merged
mgoin merged 2 commits intovllm-project:mainfrom bjmsong:fp8Jan 2, 2025

Commits

Commits on Jan 2, 2025