[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4 #10687

tlrmchlsmth · 2024-11-27T02:21:58Z

Clean up a useless if-else in marlin_24_cuda_kernel.cu:

vllm/csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu

Lines 299 to 305 in 0a4d968

    
           if (group_blocks != -1) { 
        
             s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) + 
        
                       (threadIdx.x % 32) / 4; 
        
           } else { 
        
             s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) + 
        
                       (threadIdx.x % 32) / 4; 
        
           }

See the equivalent if-else in marlin_cuda_kernel.cu:

vllm/csrc/quantization/marlin/dense/marlin_cuda_kernel.cu

Lines 367 to 372 in 80ca1e6

    
           if (group_blocks != -1) 
        
             s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) + 
        
                       (threadIdx.x % 32) / 4; 
        
           else 
        
             s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) + 
        
                       (threadIdx.x % 32) % 4;

See discussion in #6030

From @alexm-neuralmagic in that issue:

this is a leftover from a copy-paste from the original (dense) marlin that had different branches. We have tests to verify all of these cases inside test_marlin_gemm.py that verify both dense and sparse and group_size == -1 and other group_sizes as well.

I'd love to leave a better explanation for the discrepancy between the equivalent if statement in marlin_cuda_kernel.cu, because this looks very suspicious.

(closes #6030)

github-actions · 2024-11-27T02:22:09Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

mgoin

Thanks for addressing :)

csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu

Signed-off-by: Tyler Michael Smith <[email protected]>

…roject#10687) Signed-off-by: Tyler Michael Smith <[email protected]> Signed-off-by: Andrew Feldman <[email protected]>

…roject#10687) Signed-off-by: Tyler Michael Smith <[email protected]>

tlrmchlsmth requested review from mgoin and alexm-redhat November 27, 2024 02:22

mgoin approved these changes Nov 27, 2024

View reviewed changes

csrc/quantization/marlin/sparse/marlin_24_cuda_kernel.cu Outdated Show resolved Hide resolved

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 27, 2024

[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4

2ed0278

Signed-off-by: Tyler Michael Smith <[email protected]>

tlrmchlsmth force-pushed the marlin_24_if_cleanup branch from 4af5f6a to 2ed0278 Compare November 27, 2024 02:34

tlrmchlsmth enabled auto-merge (squash) November 27, 2024 02:35

youkaichao disabled auto-merge November 27, 2024 06:55

youkaichao merged commit e225110 into vllm-project:main Nov 27, 2024
66 of 68 checks passed

youkaichao deleted the marlin_24_if_cleanup branch November 27, 2024 06:55

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[Kernel] Remove if-else with identical branches in marlin 2:4 (vllm-p…

8341a04

…roject#10687) Signed-off-by: Tyler Michael Smith <[email protected]>

BKitor pushed a commit to BKitor/vllm that referenced this pull request Dec 30, 2024

[Kernel] Remove if-else with identical branches in marlin 2:4 (vllm-p…

cdc3104

…roject#10687) Signed-off-by: Tyler Michael Smith <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4 #10687

[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4 #10687

tlrmchlsmth commented Nov 27, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Nov 27, 2024

mgoin left a comment

	if (group_blocks != -1) {
	s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) +
	(threadIdx.x % 32) / 4;
	} else {
	s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) +
	(threadIdx.x % 32) / 4;
	}

	if (group_blocks != -1)
	s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) +
	(threadIdx.x % 32) / 4;
	else
	s_sh_rd = 8 * ((threadIdx.x / 32) % (thread_n_blocks / 4)) +
	(threadIdx.x % 32) % 4;

[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4 #10687

[Cleanup][Kernel] Remove if-else with identical branches in marlin 2:4 #10687

Conversation

tlrmchlsmth commented Nov 27, 2024 • edited by github-actions bot Loading

github-actions bot commented Nov 27, 2024

mgoin left a comment

Choose a reason for hiding this comment

tlrmchlsmth commented Nov 27, 2024 •

edited by github-actions bot

Loading