Add instructions for using this app on a system with no GPU #19

jmatthiesen · 2024-07-10T22:52:26Z

This sample will work best on a system that has a GPU, but it can be used on a system without one if necessary. As we work on setup instructions for this app, we should include details for getting it to work on these systems with no GPU. The following are the steps I had to take so far to get it working and I'll keep adding to this until I can use all the sample use cases:

Update the Ollama bootstrapping code so it does not use a GPU. In the AppHost project, Program.cs change this code:

var chatCompletion = builder.AddOllama("chatcompletion").WithDataVolume();

to

var chatCompletion = builder.AddOllama("chatcompletion", enableGpu: false).WithDataVolume();

In the PythonInference project, change requirements.txt to use versions of Torch libraries without CUDA. Change it from:

--extra-index-url https://download.pytorch.org/whl/cu118
torch==2.3.1+cu118
torchaudio==2.3.1+cu118
torchvision==0.18.1+cu118

to:

torch==2.3.1
torchaudio==2.3.1
torchvision==0.18.1

Change the PythonInference project so that it does not use CUDA for accessing models. In the routers/classifier.py file, change this line from:

classifier = pipeline('zero-shot-classification', model='cross-encoder/nli-MiniLM2-L6-H768', device='cuda')

to:

classifier = pipeline('zero-shot-classification', model='cross-encoder/nli-MiniLM2-L6-H768')

When running without a GPU, responses from the models will be slower and default settings for timeouts are not enough. I had to update my ServiceDefaults project, Extensions.cs file to increase the StandardResilience timeouts. The following worked for me, but on some systems a different timeout may be needed. In Extensions.AddServiceDefaults, change the StandardResilienceHandler from:

http.AddStandardResilienceHandler();

to:

http.AddStandardResilienceHandler(options =>
{
    options.AttemptTimeout = new HttpTimeoutStrategyOptions
    {
        Timeout = TimeSpan.FromMinutes(10)
    };
    options.TotalRequestTimeout = new HttpTimeoutStrategyOptions
    {
        Timeout = TimeSpan.FromMinutes(10)
    };
    options.CircuitBreaker.SamplingDuration = TimeSpan.FromMinutes(20);
});

The text was updated successfully, but these errors were encountered:

.

kannan-cidc · 2024-08-22T06:58:58Z

@jmatthiesen thanks for this solution this helped a lot

PureKrome · 2024-10-24T01:17:22Z

The README mentions the prerequisite of an NVIDIA GPU. What about AMD GPU's, like Radeon RX7700S ? Are AMD GPU's supported?

jmatthiesen self-assigned this Jul 10, 2024

jmatthiesen added a commit that referenced this issue Jul 10, 2024

Changes needed to run on a system without a GPU. Documented in issue #19

c5911ed

.

thangchung mentioned this issue Sep 5, 2024

Threw exception on python-inference project when run the solution #35

Open

SteveSandersonMS mentioned this issue Oct 10, 2024

Avoid problems with Python dependencies #42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instructions for using this app on a system with no GPU #19

Add instructions for using this app on a system with no GPU #19

jmatthiesen commented Jul 10, 2024 •

edited

Loading

kannan-cidc commented Aug 22, 2024

PureKrome commented Oct 24, 2024

Add instructions for using this app on a system with no GPU #19

Add instructions for using this app on a system with no GPU #19

Comments

jmatthiesen commented Jul 10, 2024 • edited Loading

kannan-cidc commented Aug 22, 2024

PureKrome commented Oct 24, 2024

jmatthiesen commented Jul 10, 2024 •

edited

Loading