-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633
Comments
Hi @ArtemGordinsky, thanks for reaching out! Could you try following this tutorial to compile it? |
I compiled it for you, just add this URL to download the model and it should work: |
@acalatrava Thank you! |
My bad... The problem is that the public MLC-LLM iPhone app does not have the library for this model, so when you load this model it will crash because it won't find the library for it. Every time you create a model you have to package the app with the library for that model. If you have a Mac you should be able to compile the app and install it by following the instructions on the docs. |
Now that I think more about it it may work by modifying the Unfortunately I cannot test it since I only have an iPhone 12... |
@acalatrava This worked, thank you! 🙌 |
Great! It seems that I should read the docs too since it’s explained here https://mlc.ai/mlc-llm/docs/get_started/mlc_chat_config.html#configure-mlc-chat-json 😅 |
Great model! Thank you! |
Hi @acalatrava |
⚙️ Request New Models
Additional context
Hi,
I've tried running this model on my iPhone 13 Pro Max, but it crashes instantly:
https://huggingface.co/mlc-ai/mlc-chat-llama2-7b-chat-uncensored-q4f16_1
However, this one runs really well:
https://huggingface.co/mlc-ai/mlc-chat-Llama-2-7b-chat-hf-q3f16_1
I think
7b-chat-uncensored
would work as well if it were quantized at 3 bits instead of 4. Would you be able to add it to your HuggingFace space?I'm also open to doing this myself but the last time I tried, quantization did not work on my M1 MacBook using llama.cpp.
The text was updated successfully, but these errors were encountered: