Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] #3059

Open
alexdsh opened this issue Dec 6, 2024 · 1 comment
Open

[Question] #3059

alexdsh opened this issue Dec 6, 2024 · 1 comment
Labels
question Question about the usage

Comments

@alexdsh
Copy link

alexdsh commented Dec 6, 2024

❓ General Questions

add the ability to load other models, except for those that are by default. Make a choice from the local storage. Is it possible to somehow limit the level of loading of the graphics core, to at least 90%, since when the model is running, the phone freezes completely, including stopping the interface update (I generally just have a clean screen, white).

@alexdsh alexdsh added the question Question about the usage label Dec 6, 2024
@ereish64
Copy link

I don't think CPU offloading is available at the moment (someone please correct me if I am wrong on this), however you can compile the model quantized so that it takes less memory (and processing power) if you haven't already. Try q4fp16 / 4 bit, floating point 16.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Question about the usage
Projects
None yet
Development

No branches or pull requests

2 participants