Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

mlc-ai / mlc-llm Public

Notifications You must be signed in to change notification settings
Fork 1.6k
Star 19.4k

Code
Issues 210
Pull requests 12
Actions
Projects 2
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: mlc-ai/mlc-llm

Labels 13 Milestones 0

Labels 13 Milestones 0

New pull request New

12 Open 1,603 Closed

12 Open 1,603 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add prefix-cache dataset in mlc bench

#3065 opened Dec 13, 2024 by jinhongyii

Loading…

1

MicroServing Implementation

#3064 opened Dec 13, 2024 by jinhongyii

Loading…

4

[Model] Add support for OLMo architecture

#3046 opened Nov 24, 2024 by Lanssi

Loading…

7

[Bench] Add support for multiple backend

#3037 opened Nov 20, 2024 by cyx-6 • Draft

[Model] Add support for GPTJ architecture

#3012 opened Nov 4, 2024 by tlopex

Loading…

4

[SERVE][CPP][Android] add native executable program to benchmark models

#2987 opened Oct 18, 2024 by pfk-beta

Loading…

[Model] Add use_qk_norm option for Cohere model

#2877 opened Sep 2, 2024 by tlopex

Loading…

4

[Serving] PagedKVCache Quantization

#2663 opened Jul 16, 2024 by davidpissarra

Loading…

[Bench] Add bench for GSM8K eval

#2585 opened Jun 16, 2024 by Hzfengsy

Loading…

[Bench] Add bench for MMLU eval

#2584 opened Jun 16, 2024 by Hzfengsy

Loading…

Add docker container support

#1271 opened Nov 15, 2023 by Sing-Li

Loading…

5

Implement Whisper in new concise nn.Module API

#868 opened Sep 5, 2023 by LeshengJin

Loading…

6

ProTip! Add no:assignee to see everything that’s not assigned.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.