Skip to content

Pull requests: neuralmagic/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add: Support for Sparse24Bitmask Compressed Models
#47 opened Dec 17, 2024 by rahul-tuli Loading…
1 task
merged
#46 opened Dec 15, 2024 by robertgshaw2-neuralmagic Loading…
Logprobs
#45 opened Dec 15, 2024 by robertgshaw2-neuralmagic Loading…
Proto
#44 opened Dec 12, 2024 by robertgshaw2-neuralmagic Loading…
Cutlass grouped gemm
#42 opened Dec 10, 2024 by ElizaWszola Loading…
[DRAFT] use cutlass for 24
#33 opened Nov 15, 2024 by rahul-tuli Draft
Semi structured v2
#32 opened Nov 13, 2024 by ilmarkov Loading…
Add hf_transfer to testing image
#29 opened Nov 6, 2024 by mgoin Loading…
Hqq support
#21 opened Oct 14, 2024 by ElizaWszola Draft
Update cpu_extension.cmake stale
#12 opened Sep 23, 2024 by ProExpertProg Loading…
test
#7 opened Aug 28, 2024 by robertgshaw2-neuralmagic Draft
ProTip! What’s not been updated in a month: updated:<2024-11-23.