-
Notifications
You must be signed in to change notification settings - Fork 545
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update ModelRunner Weights From Distributed
#2248
opened Nov 29, 2024 by
zhaochenyang20
Loading…
3 tasks done
move vllm distributed module (v0.6.4.post1) to sglang
high priority
#2244
opened Nov 28, 2024 by
yizhang2077
Loading…
5 tasks
[benchmark] Add fused_moe_triton benchmark and tuning tools
#2225
opened Nov 27, 2024 by
BBuf
Loading…
Update model_loader deps and qqq quantization deps
high priority
#2220
opened Nov 27, 2024 by
HandH1998
Loading…
MoE Expert Parallel Impl
enhancement
New feature or request
high priority
#2203
opened Nov 26, 2024 by
xiaobochen123
Loading…
1 task
feat: use cascade attention kernel (single level)
#2101
opened Nov 20, 2024 by
james-p-xu
•
Draft
1 of 3 tasks
Add log input text when using openai chat api
await-response
#2058
opened Nov 17, 2024 by
ccjincong
Loading…
3 tasks done
Surpport kv cache int8/int4 for triton backend
await-response
#1644
opened Oct 12, 2024 by
yuguo-Jack
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.