[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633

artemgordinskiy · 2023-08-01T10:31:24Z

⚙️ Request New Models

Link to an existing implementation (e.g. Hugging Face/Github): https://huggingface.co/TheBloke/Luna-AI-Llama2-Uncensored-GGML
Is this model architecture supported by MLC-LLM? (the list of supported models) Yes

Additional context

Hi,
I've tried running this model on my iPhone 13 Pro Max, but it crashes instantly:
https://huggingface.co/mlc-ai/mlc-chat-llama2-7b-chat-uncensored-q4f16_1

However, this one runs really well:
https://huggingface.co/mlc-ai/mlc-chat-Llama-2-7b-chat-hf-q3f16_1

I think 7b-chat-uncensored would work as well if it were quantized at 3 bits instead of 4. Would you be able to add it to your HuggingFace space?
I'm also open to doing this myself but the last time I tried, quantization did not work on my M1 MacBook using llama.cpp.

The text was updated successfully, but these errors were encountered:

CharlieFRuan · 2023-08-08T07:19:02Z

Hi @ArtemGordinsky, thanks for reaching out! Could you try following this tutorial to compile it?

acalatrava · 2023-08-09T12:18:19Z

I compiled it for you, just add this URL to download the model and it should work:
https://huggingface.co/acalatrava/mlc-chat-luna-ai-llama2-7b-chat-uncensored-q3f16_1

#692

artemgordinskiy · 2023-08-09T12:47:46Z

@acalatrava Thank you!
Unfortunately, it crashes instantly on load for me.
I don't know how to debug this since I don't see any error message. I'm guessing it's OOM, but the "censored" model ran fine so I'm not sure what it is...

acalatrava · 2023-08-09T14:15:23Z

My bad... The problem is that the public MLC-LLM iPhone app does not have the library for this model, so when you load this model it will crash because it won't find the library for it. Every time you create a model you have to package the app with the library for that model. If you have a Mac you should be able to compile the app and install it by following the instructions on the docs.

acalatrava · 2023-08-09T14:19:51Z

Now that I think more about it it may work by modifying the mlc-chat-config.json file, so I did it and upload it to huggingface. Please @ArtemGordinsky try to remove the model and download it again, it may work.

Unfortunately I cannot test it since I only have an iPhone 12...

artemgordinskiy · 2023-08-09T14:50:02Z

@acalatrava This worked, thank you! 🙌

acalatrava · 2023-08-09T14:54:10Z

Great! It seems that I should read the docs too since it’s explained here https://mlc.ai/mlc-llm/docs/get_started/mlc_chat_config.html#configure-mlc-chat-json 😅

dylanbeadle · 2023-08-09T15:18:02Z

Great model! Thank you!

Re4mer · 2023-10-18T22:50:06Z

Hi @acalatrava
can you please make a q4f16_1 version of luna-ai-llama2-7b-chat-uncensored?
i want to use it on an Android phone and the app does not support 3bits quantization unlike the IOS app.

artemgordinskiy added the new-models label Aug 1, 2023

CharlieFRuan mentioned this issue Aug 8, 2023

[Tracking] New Model Requests #692

Closed

11 tasks

artemgordinskiy closed this as completed Aug 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633

[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633

artemgordinskiy commented Aug 1, 2023 •

edited

Loading

CharlieFRuan commented Aug 8, 2023

acalatrava commented Aug 9, 2023

artemgordinskiy commented Aug 9, 2023

acalatrava commented Aug 9, 2023

acalatrava commented Aug 9, 2023

artemgordinskiy commented Aug 9, 2023

acalatrava commented Aug 9, 2023

dylanbeadle commented Aug 9, 2023

Re4mer commented Oct 18, 2023

[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633

[Model Request] 3 bit, 7B Luna-AI-Llama2-Uncensored-GGML #633

Comments

artemgordinskiy commented Aug 1, 2023 • edited Loading

⚙️ Request New Models

Additional context

CharlieFRuan commented Aug 8, 2023

acalatrava commented Aug 9, 2023

artemgordinskiy commented Aug 9, 2023

acalatrava commented Aug 9, 2023

acalatrava commented Aug 9, 2023

artemgordinskiy commented Aug 9, 2023

acalatrava commented Aug 9, 2023

dylanbeadle commented Aug 9, 2023

Re4mer commented Oct 18, 2023

artemgordinskiy commented Aug 1, 2023 •

edited

Loading