Skip to content

Commit

Permalink
Fix the packBIdx when HasEccHalf is ture but HasWMMA_V1 is false and …
Browse files Browse the repository at this point in the history
…scheduleIterAlg = 3
  • Loading branch information
vin-huang committed Nov 25, 2024
1 parent 4e31cef commit 76ab700
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tensilelite/Tensile/KernelWriter.py
Original file line number Diff line number Diff line change
Expand Up @@ -1329,7 +1329,7 @@ def hasAnyDependency(lr: DSLoadInstruction, insts: List[Instruction]):
packBIdx = packBIdx if tPB["bpe"] < 4 and (not kernel["UnrollMajorLDSB"] or kernel["ConvertAfterDS"]) else 0
else:
packAIdx = packAIdx if tPA["localReadInstruction"].blockWidth == 0.25 else 0
packBIdx = packAIdx if tPB["localReadInstruction"].blockWidth == 0.25 else 0
packBIdx = packBIdx if tPB["localReadInstruction"].blockWidth == 0.25 else 0
numPack = (packAIdx + packBIdx)
if kernel["ProblemType"]["Sparse"] and not kernel["DirectToVgprSparseMetadata"]:
packMIdx = packMIdx if not kernel["UnrollMajorLDSMetadata"] else 0
Expand Down

0 comments on commit 76ab700

Please sign in to comment.