Skip to content

Commit

Permalink
(temp workaround for Triton bug)
Browse files Browse the repository at this point in the history
Triton's autotuner had a bug (fixed in
triton-lang/triton@184fb53 )
that we happen to trigger - this PR is a temp. workaround before we
patch/update Triton.
  • Loading branch information
ilia-cher committed Dec 5, 2024
1 parent 97fd542 commit 55a6889
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/attention/ops/triton_flash_attention.py
Original file line number Diff line number Diff line change
Expand Up @@ -367,12 +367,12 @@ def attn_fwd(
IS_CAUSAL: tl.constexpr,
BLOCK_M: tl.constexpr,
BLOCK_DMODEL: tl.constexpr,
USE_FP8: tl.constexpr,
BLOCK_N: tl.constexpr,
PRE_LOAD_V: tl.constexpr,
BIAS_TYPE: tl.constexpr,
ENABLE_DROPOUT: tl.constexpr,
RETURN_ENCODED_SOFTMAX: tl.constexpr,
USE_FP8: tl.constexpr,
):
start_m = tl.program_id(0)
off_h_q = tl.program_id(1)
Expand Down

0 comments on commit 55a6889

Please sign in to comment.