-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Hardware][Intel-Gaudi] Enable LoRA support for Intel Gaudi (HPU) #10565
Conversation
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
8e40aec
to
965c60a
Compare
Can we do it like |
I'd to get a little more clarity on I see that classes like But here neither @jeejeelee Please correct if I am missing something here. |
@SanjuCSudhakaran Thanks for your feedback. |
@jeejeelee Can you please confirm whether this is the slack you mentioned https://communityinviter.com/apps/vllm-dev/join-vllm-developers-slack It looks like this is it |
This pull request has merge conflicts that must be resolved before it can be |
965c60a
to
0ab3ebf
Compare
0ab3ebf
to
d51d66c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor changes. Thank you very much for your contribution and patience.
d51d66c
to
642c8cd
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you very much for your contribution, and I especially appreciate your willingness to discuss the refactoring of the punicawrapper together
@SanjuCSudhakaran could you plz sync |
Signed-off-by: Sanju C Sudhakaran <[email protected]>
Signed-off-by: Sanju C Sudhakaran <[email protected]>
Signed-off-by: Sanju C Sudhakaran <[email protected]>
642c8cd
to
415b717
Compare
…lm-project#10565) Signed-off-by: Sanju C Sudhakaran <[email protected]>
…lm-project#10565) Signed-off-by: Sanju C Sudhakaran <[email protected]>
…lm-project#10565) Signed-off-by: Sanju C Sudhakaran <[email protected]>
This PR enables support for LoRA on Intel Gaudi by adding HPU specific kernels in
PunicaWrapperHPU
to handle LoRA computations more efficiently on the hardware.