Files

.buildkite
.github
benchmarks
- cutlass_benchmarks
- disagg_benchmarks
- fused_kernels
- kernels
- overheads
- structured_schemas
- README.md
- backend_request_func.py
- benchmark_guided.py
- benchmark_latency.py
- benchmark_long_document_qa_throughput.py
- benchmark_prefix_caching.py
- benchmark_prioritization.py
- benchmark_serving.py
- benchmark_serving_guided.py
- benchmark_throughput.py
- launch_tgi_server.sh
- sonnet.txt
cmake
csrc
docs
examples
tests
tools
vllm
.clang-format
.dockerignore
.gitignore
.readthedocs.yaml
.shellcheckrc
.yapfignore
CMakeLists.txt
CODE_OF_CONDUCT.md
CONTRIBUTING.md
DCO
Dockerfile
Dockerfile.arm
Dockerfile.cpu
Dockerfile.hpu
Dockerfile.neuron
Dockerfile.openvino
Dockerfile.ppc64le
Dockerfile.rocm
Dockerfile.tpu
Dockerfile.xpu
LICENSE
MANIFEST.in
README.md
SECURITY.md
collect_env.py
find_cuda_init.py
format.sh
pyproject.toml
python_only_dev.py
requirements-build.txt
requirements-common.txt
requirements-cpu.txt
requirements-cuda.txt
requirements-dev.txt
requirements-hpu.txt
requirements-lint.txt
requirements-neuron.txt
requirements-openvino.txt
requirements-rocm.txt
requirements-test.in
requirements-test.txt
requirements-tpu.txt
requirements-xpu.txt
setup.py
use_existing_torch.py

benchmarks

ApostaC

and

KuntaiDu

[Benchmark] Add benchmark script for CPU offloading (vllm-project#11533 )

Jan 1, 2025

0c6f998 · Jan 1, 2025

History

This branch is 96 commits behind vllm-project/vllm:main.

Name		Name	Last commit message	Last commit date
parent directory ..
cutlass_benchmarks		cutlass_benchmarks
disagg_benchmarks		disagg_benchmarks
fused_kernels		fused_kernels
kernels		kernels
overheads		overheads
structured_schemas		structured_schemas
README.md		README.md
backend_request_func.py		backend_request_func.py
benchmark_guided.py		benchmark_guided.py
benchmark_latency.py		benchmark_latency.py
benchmark_long_document_qa_throughput.py		benchmark_long_document_qa_throughput.py
benchmark_prefix_caching.py		benchmark_prefix_caching.py
benchmark_prioritization.py		benchmark_prioritization.py
benchmark_serving.py		benchmark_serving.py
benchmark_serving_guided.py		benchmark_serving_guided.py
benchmark_throughput.py		benchmark_throughput.py
launch_tgi_server.sh		launch_tgi_server.sh
sonnet.txt		sonnet.txt

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

You can download the dataset by running:

wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json

Downloading the ShareGPT4V dataset

The json file refers to several image datasets (coco, llava, etc.). The benchmark scripts will ignore a datapoint if the referred image is missing.

wget https://huggingface.co/datasets/Lin-Chen/ShareGPT4V/resolve/main/sharegpt4v_instruct_gpt4-vision_cap100k.json
mkdir coco -p
wget http://images.cocodataset.org/zips/train2017.zip -O coco/train2017.zip
unzip coco/train2017.zip -d coco/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmarks

benchmarks

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

Downloading the ShareGPT4V dataset

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

Downloading the ShareGPT4V dataset