Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA error: CUBLAS_STATUS_NOT_SUPPORTED #37

Open
Captain-SeaL opened this issue Aug 23, 2024 · 10 comments
Open

CUDA error: CUBLAS_STATUS_NOT_SUPPORTED #37

Captain-SeaL opened this issue Aug 23, 2024 · 10 comments

Comments

@Captain-SeaL
Copy link

ERROR diffusion_model.output_blocks.5.1.transformer_blocks.1.ff.net.2.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.1.attn2.to_q.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.1.attn2.to_k.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.1.attn2.to_v.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.1.attn2.to_out.0.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.1.proj_out.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.5.2.conv.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.6.0.in_layers.2.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.6.0.emb_layers.1.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.6.0.out_layers.3.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.6.0.skip_connection.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.7.0.in_layers.2.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.7.0.emb_layers.1.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.7.0.out_layers.3.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.7.0.skip_connection.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.8.0.in_layers.2.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.8.0.emb_layers.1.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.8.0.out_layers.3.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
ERROR diffusion_model.output_blocks.8.0.skip_connection.weight CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)

@lshqqytiger
Copy link
Owner

lshqqytiger commented Aug 24, 2024

sgemm is already implemented. Please check whether ZLUDA BLAS is loaded and is being used. (cublas.dll on Windows, libcublas.so on Linux)

@Captain-SeaL
Copy link
Author

tasklist /m cublas.dll
i write this in cmd if i understood right and it says what can't find it
Do I need to replace some dlls in the python libraries?

@lshqqytiger
Copy link
Owner

What application are you trying?

@Captain-SeaL
Copy link
Author

Fooocus

@lshqqytiger
Copy link
Owner

lshqqytiger commented Aug 24, 2024

Follow the second paragraph of ZLUDA PyTorch instruction.

cublas64_*.dll
cusparse64_*.dll
nvrtc64_*_*.dll

@CS1o
Copy link

CS1o commented Sep 14, 2024

Fooocus

Here you find a install Guide for Fooocus with ZLUDA. I tested it a few minutes ago.
https://github.com/CS1o/Stable-Diffusion-Info/wiki/Installation-Guides

@HysterLc
Copy link

Follow the second paragraph of ZLUDA PyTorch instruction.

cublas64_*.dll
cusparse64_*.dll
nvrtc64_*_*.dll

Error loading caffe2_nvrtc.dll or its dependencied after replace the three dll

@lshqqytiger
Copy link
Owner

Make sure that you have

  1. AMD GPU driver (amdhip64.dll, or amdhip64_6.dll with HIP SDK 6.1)
  2. HIP SDK (rocblas.dll, rocsolver.dll, rocsparse.dll, and hiprtc0601.dll (0507 in HIP SDK 5.7))
  3. Microsoft Visual C Runtime (vcruntime140.dll)

@CS1o
Copy link

CS1o commented Nov 15, 2024

Error loading caffe2_nvrtc.dll or its dependencied after replace the three dll

That error is caused by any Python Version that got installed through the Microsoft Store.
To fix it, uninstall all Python Versions you have under the Windows Settings/APPS.
Then install Python 3.10.11 64bit with its normal installer from here: https://www.python.org/downloads/release/python-31011/
Check "Add Python to Path" and reboot the System.

Open up a cmd and type
python --version
and
where python

Verify that the Path to Python 3.10.11 is at the top.

@HysterLc
Copy link

Error loading caffe2_nvrtc.dll or its dependencied after replace the three dll

That error is caused by any Python Version that got installed through the Microsoft Store. To fix it, uninstall all Python Versions you have under the Windows Settings/APPS. Then install Python 3.10.11 64bit with its normal installer from here: https://www.python.org/downloads/release/python-31011/ Check "Add Python to Path" and reboot the System.

Open up a cmd and type python --version and where python

Verify that the Path to Python 3.10.11 is at the top.

It's a little better, but new error emerge
Error loading "C:\Users\M1175\AppData\Local\Programs\Python\Python310\lib\site-packages\torch\lib\cusolver64_11.dll" or one of its dependencies.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants