Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update ModelRunner Weights From Distributed
#2248 opened Nov 29, 2024 by zhaochenyang20 Loading…
3 tasks done
Support LoRA in Completion API
#2243 opened Nov 28, 2024 by bjmsong Loading…
3 tasks
Add a simple torch native attention backend
#2241 opened Nov 28, 2024 by YangQun1 Loading…
3 tasks
[FEAT] Support GGUF format
#2215 opened Nov 27, 2024 by zhengy001 Loading…
2 of 3 tasks
MoE Expert Parallel Impl enhancement New feature or request high priority
#2203 opened Nov 26, 2024 by xiaobochen123 Loading…
1 task
Support top n sigma sampling
#2192 opened Nov 26, 2024 by Snowdar Loading…
1 of 3 tasks
feat: add should_use_tensor_cores
#2179 opened Nov 25, 2024 by zhyncs Draft
3 tasks
test select concurrency
#2165 opened Nov 24, 2024 by qeternity Loading…
Speculative EAGLE2 high priority
#2150 opened Nov 24, 2024 by yukavio Loading…
Byhsu/fairness router
#2149 opened Nov 24, 2024 by ByronHsu Draft
3 tasks
feat: use cascade attention kernel (single level)
#2101 opened Nov 20, 2024 by james-p-xu Draft
1 of 3 tasks
Add log input text when using openai chat api await-response
#2058 opened Nov 17, 2024 by ccjincong Loading…
3 tasks done
[TEST] flashinfer version upgrade to v0.2.0
#2054 opened Nov 17, 2024 by james-p-xu Draft
3 tasks
regex stopping condition
#2035 opened Nov 14, 2024 by jancervenka Loading…
3 tasks done
[WIP] Use FlashInfer RoPE
#2016 opened Nov 12, 2024 by james-p-xu Loading…
3 tasks done
Debug studio await-response
#1831 opened Oct 29, 2024 by zolinthecow Loading…
3 tasks done
Function calling for OpenAI backend
#573 opened Jun 29, 2024 by Yiyun-Liang Loading…
ProTip! Add no:assignee to see everything that’s not assigned.