[vllm] Add support for FP8 in Triton FA kernel #1087
Annotations
2 errors and 1 warning
vllm/attention/ops/triton_flash_attention.py#L757
vllm/attention/ops/triton_flash_attention.py:757:81: E501 Line too long (85 > 80)
|
|
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
This job failed
Loading