diff --git a/docs/source/serving/deploying_with_docker.rst b/docs/source/serving/deploying_with_docker.rst index 574081b112a59..7aceb49463449 100644 --- a/docs/source/serving/deploying_with_docker.rst +++ b/docs/source/serving/deploying_with_docker.rst @@ -51,7 +51,7 @@ of PyTorch Nightly and should be considered **experimental**. Using the flag `-- .. code-block:: console - # Example of building on Nvidia GH200 server. (Memory usage: ~15GB, Build time: ~1475s / ~25 min, Image size: 7.26GB) + # Example of building on Nvidia GH200 server. (Memory usage: ~15GB, Build time: ~1475s / ~25 min, Image size: 3.23GB) $ python3 use_existing_torch.py $ DOCKER_BUILDKIT=1 sudo docker build . \ --target vllm-openai \