Skip to content

Commit

Permalink
fix: vllm e2e dependencies and env_vars
Browse files Browse the repository at this point in the history
  • Loading branch information
YrrepNoj committed Jul 26, 2024
1 parent 8acaff6 commit 6092128
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 0 deletions.
1 change: 1 addition & 0 deletions .github/workflows/e2e-vllm.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -69,6 +69,7 @@ jobs:
run: |
python -m pip install "."
python -m pip install -U "huggingface_hub[cli,hf_transfer]"
python -m pip install confz
- name: Setup UDS Environment
uses: defenseunicorns/uds-common/.github/actions/setup@05f42bb3117b66ebef8c72ae050b34bce19385f5
Expand Down
3 changes: 3 additions & 0 deletions packages/vllm/zarf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,7 @@ components:
- "vllm-values.yaml"
images:
- ghcr.io/defenseunicorns/leapfrogai/vllm:###ZARF_PKG_TMPL_IMAGE_VERSION###
- cgr.dev/chainguard/bash:latest
dataInjections:
- source: .model/
target:
Expand All @@ -41,3 +42,5 @@ components:
env:
- LAI_REPO_ID=TheBloke/Synthia-7B-v2.0-GPTQ
- LAI_REVISION=gptq-4bit-32g-actorder_True
- LAI_QUANTIZATION=gptq
- LAI_HF_HUB_ENABLE_HF_TRANSFER=1

0 comments on commit 6092128

Please sign in to comment.