Skip to content

Commit

Permalink
Merge branch 'main' of https://github.com/OpenLMLab/MOSS into main
Browse files Browse the repository at this point in the history
  • Loading branch information
Hzfinfdu committed Apr 23, 2023
2 parents e92dae6 + eb19434 commit 5a2acc9
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -200,6 +200,7 @@ pip install triton

#### 模型量化

**目前仅支持单卡部署量化模型**
在显存受限的场景下,调用量化版本的模型可以显著降低推理成本。我们使用[GPTQ](https://github.com/IST-DASLab/gptq)算法和[GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa)中推出的OpenAI [triton](https://github.com/openai/triton) backend实现量化推理:

~~~python
Expand Down

0 comments on commit 5a2acc9

Please sign in to comment.