Skip to content

Commit

Permalink
[Hardware][CPU] Cross-attention and Encoder-Decoder models support on…
Browse files Browse the repository at this point in the history
… CPU backend (vllm-project#9089)
  • Loading branch information
Isotr0py authored Oct 7, 2024
1 parent 8c6de96 commit 4f95ffe
Show file tree
Hide file tree
Showing 6 changed files with 834 additions and 287 deletions.
1 change: 1 addition & 0 deletions .buildkite/run-cpu-test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -23,6 +23,7 @@ docker exec cpu-test-avx2 bash -c "python3 examples/offline_inference.py"
# Run basic model test
docker exec cpu-test bash -c "
pip install pytest matplotlib einops transformers_stream_generator datamodel_code_generator
pytest -v -s tests/models/encoder_decoder/language
pytest -v -s tests/models/decoder_only/language \
--ignore=tests/models/test_fp8.py \
--ignore=tests/models/decoder_only/language/test_jamba.py \
Expand Down
Loading

0 comments on commit 4f95ffe

Please sign in to comment.