From fb1759e3a3e1f77a41a0702602b70ff82ff282fd Mon Sep 17 00:00:00 2001 From: cenzhiyao <2523403608@qq.com> Date: Wed, 18 Dec 2024 22:24:41 +0800 Subject: [PATCH] update vllm-gh200-openai Image size: 3.23GB --- docs/source/serving/deploying_with_docker.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/serving/deploying_with_docker.rst b/docs/source/serving/deploying_with_docker.rst index 574081b112a59..7aceb49463449 100644 --- a/docs/source/serving/deploying_with_docker.rst +++ b/docs/source/serving/deploying_with_docker.rst @@ -51,7 +51,7 @@ of PyTorch Nightly and should be considered **experimental**. Using the flag `-- .. code-block:: console - # Example of building on Nvidia GH200 server. (Memory usage: ~15GB, Build time: ~1475s / ~25 min, Image size: 7.26GB) + # Example of building on Nvidia GH200 server. (Memory usage: ~15GB, Build time: ~1475s / ~25 min, Image size: 3.23GB) $ python3 use_existing_torch.py $ DOCKER_BUILDKIT=1 sudo docker build . \ --target vllm-openai \