-
Notifications
You must be signed in to change notification settings - Fork 15
Pull requests: HabanaAI/vllm-hpu-extension
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add renormalize parameter for FusedMOE's & modify experts_max arg of mixture_of_experts()
#70
opened Jan 9, 2025 by
tangleintel
Loading…
[WIP] Add option to do group sum on TPC instead of MME
#64
opened Dec 20, 2024 by
mswiniarsk
•
Draft
ProTip!
Filter pull requests by the default branch with base:main.