Support simulation of 4090 cards #8

Enoch2090 · 2024-05-10T12:49:30Z

Since 4090 really has an edge on inferencing with smaller models - is it possible to add data for 4090 cards? Thanks!

AgrawalAmey · 2024-05-10T13:10:24Z

Hi @Enoch2090, thanks a lot of your interest in the project. We definitely want to extend support to as many devices as possible. Unfortunately, I don't have access to 4090 cards, but if you are willing to contribute we could share all the profiling instructions. Thanks again!

teds-lin · 2024-05-11T14:30:18Z

Hi @Enoch2090, thanks a lot of your interest in the project. We definitely want to extend support to as many devices as possible. Unfortunately, I don't have access to 4090 cards, but if you are willing to contribute we could share all the profiling instructions. Thanks again!

Based on my current understanding, one of the key points for adding new GPU card support is collecting profiling data. Will you publish the relevant code and implementation steps? Thank!

AgrawalAmey · 2024-05-12T08:09:48Z

@teds-lin yes, the profiling code is already in the repo. We will add appropriate documentation soon. Thanks!

Enoch2090 · 2024-05-12T08:16:59Z

@AgrawalAmey Thanks for sharing! Would also be great if the document can point out approximately how much data profiled (or equivalently, how many GPU hours) is needed to produce accurate results.

AgrawalAmey · 2024-05-15T21:45:58Z

@teds-lin @Enoch2090 We have added the documentation to add profiling data for new models/devices. Adding a new model should take less than 15ish mins. Adding a new device across all 6 models should take less than a couple of hours. Please have a look https://github.com/microsoft/vidur/blob/main/vidur/profiling/README.md and let us know if you have any questions.

lhpp1314 · 2024-10-15T09:49:35Z

@AgrawalAmey The profiling link https://github.com/microsoft/vidur/blob/main/vidur/profiling/README.md is not available now. Can you give a new one?

nitinkedia7 · 2024-10-15T11:52:54Z

Hi @lhpp1314, the Supported Models section in the project README has the updated link for profiling. Please use the central readme to find links to other documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support simulation of 4090 cards #8

Support simulation of 4090 cards #8

Enoch2090 commented May 10, 2024

AgrawalAmey commented May 10, 2024

teds-lin commented May 11, 2024

AgrawalAmey commented May 12, 2024

Enoch2090 commented May 12, 2024

AgrawalAmey commented May 15, 2024

lhpp1314 commented Oct 15, 2024

nitinkedia7 commented Oct 15, 2024

Support simulation of 4090 cards #8

Support simulation of 4090 cards #8

Comments

Enoch2090 commented May 10, 2024

AgrawalAmey commented May 10, 2024

teds-lin commented May 11, 2024

AgrawalAmey commented May 12, 2024

Enoch2090 commented May 12, 2024

AgrawalAmey commented May 15, 2024

lhpp1314 commented Oct 15, 2024

nitinkedia7 commented Oct 15, 2024