Skip to content

Commit

Permalink
Update csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu
Browse files Browse the repository at this point in the history
Co-authored-by: Michael Goin <[email protected]>
  • Loading branch information
tlrmchlsmth and mgoin authored Nov 27, 2024
1 parent 72c19a8 commit 4af5f6a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu
Original file line number Diff line number Diff line change
Expand Up @@ -297,7 +297,7 @@ __global__ void Marlin_24(
// we scale a `half2` tile in column-major layout in the former and in
// row-major in the latter case.
s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) +
(threadIdx.x % 32) / 4; // Note thatiIn the original Marlin kernel
(threadIdx.x % 32) / 4; // Note that in the original Marlin kernel
// this is (threadIdx.x % 32) / 4

// Precompute which thread should not read memory in which iterations; this is
Expand Down

0 comments on commit 4af5f6a

Please sign in to comment.