Replies: 16 comments
-
InterLM L-5: LMDeploy 大模型量化部署实践 基础作业打卡~ |
Beta Was this translation helpful? Give feedback.
-
https://h90ag9106t.feishu.cn/docx/X8eudSxnBowylDxETKcc75JVnyb |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
https://github.com/RyanWu31/tutorial_notebook/blob/main/README.md |
Beta Was this translation helpful? Give feedback.
-
基础作业:
|
Beta Was this translation helpful? Give feedback.
-
基础作业: |
Beta Was this translation helpful? Give feedback.
-
基础作业: |
Beta Was this translation helpful? Give feedback.
-
第五节课作业 |
Beta Was this translation helpful? Give feedback.
-
基础作业: 进阶作业(可选做): (3)在(1)的基础上开启KV Cache量化 |
Beta Was this translation helpful? Give feedback.
-
基础作业:生成300字小故事 |
Beta Was this translation helpful? Give feedback.
-
在下方讨论区提交作业(图片/链接形式均可)~
基础作业:
进阶作业(可选做):
(1)TurboMind推理+Python代码集成
(2)在(1)的基础上采用W4A16量化
(3)在(1)的基础上开启KV Cache量化
(4)在(2)的基础上开启KV Cache量化
(5)使用Huggingface推理
⭐备注:由于进阶作业较难,完成基础作业之后就可以先提交作业了,在后续的大作业项目中使用这些技术将作为重要的加分点!
Beta Was this translation helpful? Give feedback.
All reactions