Skip to content

Pull requests: mlc-ai/mlc-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add prefix-cache dataset in mlc bench
#3065 opened Dec 13, 2024 by jinhongyii Loading…
MicroServing Implementation
#3064 opened Dec 13, 2024 by jinhongyii Loading…
[Model] Add support for OLMo architecture
#3046 opened Nov 24, 2024 by Lanssi Loading…
[Bench] Add support for multiple backend
#3037 opened Nov 20, 2024 by cyx-6 Draft
[Model] Add support for GPTJ architecture
#3012 opened Nov 4, 2024 by tlopex Loading…
[Model] Add use_qk_norm option for Cohere model
#2877 opened Sep 2, 2024 by tlopex Loading…
[Serving] PagedKVCache Quantization
#2663 opened Jul 16, 2024 by davidpissarra Loading…
[Bench] Add bench for GSM8K eval
#2585 opened Jun 16, 2024 by Hzfengsy Loading…
[Bench] Add bench for MMLU eval
#2584 opened Jun 16, 2024 by Hzfengsy Loading…
Add docker container support
#1271 opened Nov 15, 2023 by Sing-Li Loading…
Implement Whisper in new concise nn.Module API
#868 opened Sep 5, 2023 by LeshengJin Loading…
ProTip! Add no:assignee to see everything that’s not assigned.