-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not enough SMs on RTX 2080 #117
Comments
Thank you for the quick reponse, the model runs fine with float or float16. The only thing that is strange is that the execution time seems to increase with the batch size when using float. unit is ms per image at image size of ~500x500 pixel float 32 - 10 images calculated float 16 - 10 images calculated Anyway thanks again. |
@qub3s - Ah, yes. That's expected, because a bigger batch means you need to allocate more memory throughout model execution. Also see the |
Hi all I was try to run amg_example.py on 2080TI too , I know the triton kernel is only support A100 ,so according the ReadMe file its need to set the environment variable SEGMENT_ANYTHING_FAST_USE_FLASH_4=0, here is my code import OS but its still have miss the triton module error , Did I do something wrong? Or have any suggestions? thanks you |
Hey
I know that you guys optimized this project for the A100, and i read that people got the 4090 and the 3090 running. I am only able to work with 2080s (University).
When i try to run your code (amg_example.py), im getting the following errors :
torch._inductor.utils: [WARNING] not enough SMs to use max_autotune_gemm mode
followed by a bunch of "code" and then:
BackendCompilerFailed: backend='inductor' raised:
RuntimeError: Internal Triton PTX codegen error:
ptxas /tmp/compile-ptx-src-76618e, line 149; error : Feature '.bf16' requires .target sm_80 or higher
ptxas /tmp/compile-ptx-src-76618e, line 149; error : Feature 'cvt with .f32.bf16' requires .target sm_80 or higher
(.....)
ptxas /tmp/compile-ptx-src-76618e, line 200; error : Feature '.bf16' requires .target sm_80 or higher
ptxas /tmp/compile-ptx-src-76618e, line 200; error : Feature 'cvt with .f32.bf16' requires .target sm_80 or higher
ptxas fatal : Ptx assembly aborted due to error
Is it just a shortcoming of my hardware or is there anything i am doing wrong.
PS: the Original model runs fine and your project runs as well if i use "sam_model_registry" (i guess that is just the meta implementation)
Thank you.
The text was updated successfully, but these errors were encountered: