Fix for docs on deepspeed inference #482

josemduarte · 2024-08-26T23:08:28Z

The main issue is that the parameter name did not coincide with the code.

Question: could the deepspeed extra dependency on CUTLASS be baked into the docker image? I could give it a try but I'm wondering if someone has come across issues with that.

josemduarte · 2024-08-30T17:42:09Z

Question: could the deepspeed extra dependency on CUTLASS be baked into the docker image? I could give it a try but I'm wondering if someone has come across issues with that.

This patch would do the basic setup (that would still require compilation at first run): josemduarte@0183be7 .

ljarosch · 2024-12-07T04:42:12Z

Good catch on the wrong parameter name! CUTLASS should be installed with the install_third_party_dependencies.sh script though, which is already in the docs (see Step 3).

You're right though that this step is currently missing from the Docker image, we will update that.

ljarosch · 2024-12-07T04:43:31Z

docs/source/Inference.md

@@ -147,7 +147,7 @@ Some commonly used command line flags are here. A full list of flags can be view

 The **DeepSpeed DS4Sci_EvoformerAttention kernel** is a memory-efficient attention kernel developed as part of a collaboration between OpenFold and the DeepSpeed4Science initiative. 

-If your system supports deepseed, using deepspeed generally leads an inference speedup of 2 - 3x without significant additional memory use. You may specify this option by selecting the `--use_deepspeed_inference` argument. 
+If your system supports deepspeed, using deepspeed generally leads an inference speedup of 2 - 3x without significant additional memory use. You may specify this option by selecting the `--use_deepspeed_evoformer_attention` argument. An additional requirement for this option is the [CUTLASS repository](https://github.com/NVIDIA/cutlass). You will need to clone it and set environment variable `CUTLASS_PATH` to point to it, see [instructions](https://www.deepspeed.ai/tutorials/ds4sci_evoformerattention/).


@josemduarte Would you mind removing the line about CUTLASS given that it should be covered by the standard setup already?

Fix to docs on deepspeed for inference

c03d9a6

ljarosch assigned ljarosch and christinaflo Dec 7, 2024

ljarosch requested changes Dec 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for docs on deepspeed inference #482

Fix for docs on deepspeed inference #482

josemduarte commented Aug 26, 2024

josemduarte commented Aug 30, 2024

ljarosch commented Dec 7, 2024

ljarosch Dec 7, 2024

Fix for docs on deepspeed inference #482

Are you sure you want to change the base?

Fix for docs on deepspeed inference #482

Conversation

josemduarte commented Aug 26, 2024

josemduarte commented Aug 30, 2024

ljarosch commented Dec 7, 2024

ljarosch Dec 7, 2024

Choose a reason for hiding this comment