Would it be feasible to use only vram as extension of main gpu? #33

DuckersMcQuack · 2024-11-15T16:08:04Z

Where gpu0 would do all the performance, but if gpu0 vram is full, it will offload anything above that to gpu1. Not to use gpu2 to sample any images, but only purpose being to hold the data, same way shared memory, aka ram already does, but gpu1 would process that data. Or would that be even slower process than making system ram holding it due to say 8GB's transfer rate over pcie gen 4 x4 lane bifurcation?

city96 · 2024-11-15T21:54:54Z

Hmm, interesting idea but I'm unsure how you'd force comfy to manage that internally. Like, you can just set where the model goes when it's not on the main GPU via the "offload_device" param of the model patcher, but that only allows you to specify one device. For this you'd have to make the actual backend multi-GPU aware so it doesn't try to unload the entire model onto your second GPU (which would just OOM unless it can fit the entire model). Maybe if you mess with the load function but yeah, no clue.

I guess if you have 2x3090s you could try it by editing the end of the UnetLoaderGGUF class in nodes.py to look like this:

        model = GGUFModelPatcher.clone(model)
        model.offload_device = torch.device("cuda:1")
        model.model.to(model.offload_device)
        model.patch_on_device = patch_on_device
        return (model,)

My second GPU is a P40 connected via PCIe 3.0 x1 so as expected it's awful lol. It'd be faster if I was reading from my m.2 drive.

CPU offload: 2.43s/it
CUDA1 offloat: 8.32s/it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would it be feasible to use only vram as extension of main gpu? #33

Would it be feasible to use only vram as extension of main gpu? #33

DuckersMcQuack commented Nov 15, 2024

city96 commented Nov 15, 2024

Would it be feasible to use only vram as extension of main gpu? #33

Would it be feasible to use only vram as extension of main gpu? #33

Comments

DuckersMcQuack commented Nov 15, 2024

city96 commented Nov 15, 2024