[Bug] #123

yomi1117 · 2024-12-31T07:12:29Z

Environment

The Environment is right.

Describe the bug

OOM...I know video distillation needs many GPUS. I used 8 A800 to run the distillation training of Hunyuan, but I got OOM error. How many GPUs we need to run the distill...

Reproduction

None

foreverpiano · 2025-01-01T04:05:46Z

8 GPUs are enough. Can you share with your script?

jzhang38 · 2025-01-01T06:19:50Z

I just updated the script for 8 GPUs and can you try it here:

FastVideo/scripts/distill/distill_hunyuan.sh

Line 50 in dd75ee8

    
           # If you do not have 32 GPUs and to fit in memory, you can: 1. increase sp_size. 2. reduce num_latent_t

yomi1117 · 2025-01-01T15:34:04Z

Thank u for your help. I'm curious about how many GPUS and the config to train FastHunyuan~ Thank u !

jzhang38 · 2025-01-02T02:57:55Z

This should be the exact command to reproduce FastHunyuan:

FastVideo/scripts/distill/distill_hunyuan.sh

Line 7 in dd75ee8

torchrun --nnodes 4 --nproc_per_node 8\

rlsu9 closed this as completed Jan 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] #123

[Bug] #123

yomi1117 commented Dec 31, 2024

foreverpiano commented Jan 1, 2025

jzhang38 commented Jan 1, 2025 •

edited

Loading

yomi1117 commented Jan 1, 2025

jzhang38 commented Jan 2, 2025

[Bug] #123

[Bug] #123

Comments

yomi1117 commented Dec 31, 2024

Environment

Describe the bug

Reproduction

foreverpiano commented Jan 1, 2025

jzhang38 commented Jan 1, 2025 • edited Loading

yomi1117 commented Jan 1, 2025

jzhang38 commented Jan 2, 2025

jzhang38 commented Jan 1, 2025 •

edited

Loading