Use indirect dispatch without "check_order" optimization #4

arcman7 · 2024-08-25T22:36:52Z

Hi there again!

I was wondering, are there any consequences that you're aware of if I were to use indirect dispatching without the "check_order" optimization enabled?

In my scenario I would be running a pre-processing step prior to calling the RadixSortKernel. The keys and values buffers will be updated frequently. If I'm able to determine the dispatch sizes necessary for all pipelines used by the RadixSortKernel in my pre-processing shader, I'd be able to use the dispatchPipelinesIndirect method - is that correct?

The text was updated successfully, but these errors were encountered:

kishimisu · 2024-08-26T23:28:59Z

Hey!

There are a few things to take into account but it's not that hard :)
I've created a new branch that includes an additional use_indirect_dispatch boolean parameter that can be used if check_order is disabled. I can push it to main if you find this parameter useful.

However I would suggest reading the Order Checking section in the readme. During my testing I've observed that using indirect dispatch for the compute passes resulted in slower performances, that's why I didn't include the option.
I would be curious to see if it's faster for you!

arcman7 · 2024-08-27T00:31:23Z

During my testing I've observed that using indirect dispatch for the compute passes resulted in slower performances, that's why I didn't include the option.
I would be curious to see if it's faster for you!

It is odd... I've noticed that as well just by testing out different settings on your demo page. Right now my only guess is it has something to do with the number of pipelines that are created and the corresponding volume of data getting uploaded to GPU memory. I noticed though that if the number of sorted elements is large enough, the check order optimization does start to pay off in terms of reducing the time it takes to sort.

Taking a look at your branch now

arcman7 · 2024-08-29T01:35:35Z

No updates as of yet - still integrating your indirect branch with some specific modifications that I need.

I did have a small question though -

What's the difference between WORKGROUP_COUNT and num_workgroups just below on line 19?

kishimisu · 2024-10-01T16:46:18Z

Sorry for the late reply, they both represent the number of workgroups (or dispatch size) in the current pass, only in different formats:
num_workgroup is a builtin vec3 containing the number of workgroups in each dimension, and WORKGROUP_COUNT is a constant containing the total number of workgroups (x * y * z)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use indirect dispatch without "check_order" optimization #4

Use indirect dispatch without "check_order" optimization #4

arcman7 commented Aug 25, 2024

kishimisu commented Aug 26, 2024 •

edited

Loading

arcman7 commented Aug 27, 2024

arcman7 commented Aug 29, 2024

kishimisu commented Oct 1, 2024

Use indirect dispatch without "check_order" optimization #4

Use indirect dispatch without "check_order" optimization #4

Comments

arcman7 commented Aug 25, 2024

kishimisu commented Aug 26, 2024 • edited Loading

arcman7 commented Aug 27, 2024

arcman7 commented Aug 29, 2024

kishimisu commented Oct 1, 2024

kishimisu commented Aug 26, 2024 •

edited

Loading