sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 714
Star 7.4k

Code
Issues 192
Pull requests 42
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 27 Milestones 0

New pull request New

42 Open 1,919 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

add config to swtich from vllm custom allreduce to sgl_kernel custom allreduce

#2981 opened Jan 19, 2025 by yizhang2077

Loading…

4 tasks

[Fix] Address remain issues of supporting MiniCPMV

#2977 opened Jan 19, 2025 by mickqian • Draft

3 tasks done

fix deepseek v2 with cpu device

#2975 opened Jan 19, 2025 by zhyncs

Loading…

4 tasks

[MOE] try to optimize cu kernel single block execution - distribute cumsum workload from thread 0 to other threads

#2970 opened Jan 19, 2025 by yiakwy-xpu-ml-framework-team

Loading…

3 of 4 tasks

[Core] Optimize the delay scheduling of in batch prefix caching

#2962 opened Jan 18, 2025 by MrAta • Draft

4 tasks

Refactor engine and API server

#2959 opened Jan 18, 2025 by fzyzcjy

Loading…

4 tasks

[EAGLE] Fix some boundary situation when retract reqs and req's max token = 1

#2939 opened Jan 17, 2025 by josephydu

Loading…

Test removing a branch logic

#2905 opened Jan 15, 2025 by rkooo567

Loading…

3 tasks

Integration of TurboMind AWQ

#2900 opened Jan 15, 2025 by bjmsong

Loading…

3 tasks

[Feature] Support dynamic loading and unloading of Lora adapters

#2891 opened Jan 14, 2025 by Fridge003 • Draft

1 of 3 tasks

support triton backend int8 kvcache

#2864 opened Jan 13, 2025 by sleepcoo • Draft

[DO NOT MERGE] Merged PRs for verl integration

#2849 opened Jan 13, 2025 by fzyzcjy • Draft

3 tasks

Support direct weight loading

#2845 opened Jan 12, 2025 by fzyzcjy

Loading…

3 tasks done

[#2812] Make the decode status dict capcity adjustable by a CLI param

#2839 opened Jan 11, 2025 by seungduk-yanolja

Loading…

2 of 3 tasks

Support distributed tensor when updating weights

#2831 opened Jan 10, 2025 by fzyzcjy

Loading…

3 tasks done

Support custom device mesh for tensor parallel workers

#2827 opened Jan 10, 2025 by fzyzcjy

Loading…

3 tasks done

Use CUDA_VISIBLE_DEVICES instead of gpu_id variables everywhere.

#2824 opened Jan 10, 2025 by heiner

Loading…

1 task done

Improve the mixed chunk prefill by lanuch two kernels

#2811 opened Jan 9, 2025 by libratiger • Draft

1 of 3 tasks

[WIP] [Feature] Support Deepseek-VL2 enhancement

New feature or request

#2798 opened Jan 8, 2025 by ccw1996 • Draft

3 tasks

Add endpoint for file support, purely to speed up processing of input_embeds.

#2797 opened Jan 8, 2025 by RinRin-32

Loading…

2 of 3 tasks

Allow multi SGLang engines to coordinate

#2791 opened Jan 8, 2025 by fzyzcjy

Loading…

3 tasks done

Speculative decoding with lookahead enhancement

New feature or request

high priority

#2790 opened Jan 8, 2025 by jjjjohnson

Loading…

3 tasks done

Update doc for server arguments

#2742 opened Jan 5, 2025 by simveit

Loading…

1 task

WIP: Feature/function calling update

#2700 opened Jan 2, 2025 by YAMY1234

Loading…

[Feature] Support regex as a stopping condition

#2699 opened Jan 2, 2025 by mickqian

Loading…

3 tasks done

Previous 1 2 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly