Replies: 1 comment 1 reply
-
did you try reducing your n_ctx? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am attempting to get it to work with cuda on ubuntu linux but I am running out of vram even though I should have enough.
This was how I installed it
When I try to run the llm
I get the error even though I have an 8GB 3070 RTX and the model size is 6.14GB
It crashes at these lines
How do I fix this?
Beta Was this translation helpful? Give feedback.
All reactions