Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sharding Detection with multiple IO #3540

Merged

Conversation

samnordmann
Copy link
Collaborator

@samnordmann samnordmann commented Dec 6, 2024

What

Patch resharding detection routines in multidevice/utils to handle expressions with multiple I/O

Why

As a step toward https://jirasw.nvidia.com/browse/NVFUSER-106, the motivation for this patch is to be able to keep the MatmulOp (and potentially other ops in the future) as a resharding OP, which will undergo a special HostIr lowering (bypassing ReorderShardedAxisPass)

@samnordmann samnordmann force-pushed the sharding_detection_with_multiple_IO branch from a4227ee to 970d80a Compare December 6, 2024 14:32
@samnordmann
Copy link
Collaborator Author

!test

csrc/multidevice/utils.cpp Outdated Show resolved Hide resolved
csrc/multidevice/utils.cpp Outdated Show resolved Hide resolved
@wujingyue
Copy link
Collaborator

!test

@samnordmann samnordmann merged commit 2749296 into NVIDIA:main Dec 11, 2024
40 of 41 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants