-
-
Notifications
You must be signed in to change notification settings - Fork 5.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[torch.compile] PostGradPassManager, Inductor code caching fix, fix_functionalization pass refactor + tests #10273
[torch.compile] PostGradPassManager, Inductor code caching fix, fix_functionalization pass refactor + tests #10273
Conversation
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
48bd06a
to
9f9aba5
Compare
9f9aba5
to
b051eb2
Compare
af751fc
to
234ea39
Compare
bbb6300
to
5af5547
Compare
This pull request has merge conflicts that must be resolved before it can be |
5af5547
to
8561146
Compare
Huh, it worked for me, maybe I didn't check the final changes |
eecc7d3
to
d0c0c45
Compare
Add config option to disable reshape pass. Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
- rename functionalization.py - add detailed comment about caching Also: - rename TODOs - format Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
… move VllmInductorPass into own file Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: Luka Govedic <[email protected]>
Signed-off-by: Luka Govedic <[email protected]>
d0c0c45
to
ee289bc
Compare
@ProExpertProg great job! |
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>
Signed-off-by: luka <[email protected]> Signed-off-by: Luka Govedic <[email protected]>
This PR extracts the post_grad passes into
PostGradPassManager
. By overriding pickling of the class, we avoid failing to pickle custom passes, which is used in the code cache.It also extracts pass config into
PassConfig
and adds an option to disable the reshapes pass. It adds utilities for registering custom inductor passes, and extracts fix_functionalization into a proper pass.