Skip to content

Issues: mit-han-lab/qserve

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

How to test the accuracy?
#42 opened Oct 30, 2024 by lisuying214
Some questions about VLM quant
#40 opened Oct 23, 2024 by hanhanpp
How to add new models?
#33 opened Aug 23, 2024 by NicolasDrapier
RMSNorm implemented as LayerNorm
#32 opened Aug 21, 2024 by jason-huang03
Circular import error
#22 opened Jul 5, 2024 by LuckyLYM
support tp
#14 opened May 24, 2024 by cyLi-Tiger
activation quantization
#13 opened May 24, 2024 by hanhanpp
Question about the paper
#10 opened May 18, 2024 by jameswu2014
Source code
#3 opened May 9, 2024 by jph00
Is 8bit supported?
#2 opened May 8, 2024 by nivibilla
ProTip! Follow long discussions with comments:>50.