-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Tracking] New Model Requests #692
Comments
@CharlieFRuan I am very happy to contribute my code, but I need some information to get started. Can you provide some more information or documents? |
@MrJungle1 Hi, thank you for the interest! Documentation can be found here: https://llm.mlc.ai/docs/ Specifically, compiling models can be located here: https://llm.mlc.ai/docs/compilation/compile_models.html |
@CharlieFRuan Hello, thank you for your reply. I read this doc, but I don’t seem to find what I want. If I want to support a new model_type, such as "model_type": "qwen", where should I start? Or as you say, a new model architecture. |
Ahh I see! We are currently migrating to a new workflow, and documentation should be up soon (this week probably). Meanwhile, you could refer to #1408 which adds support for GPT-NeoX. Adding a new model should be similar. The commands to use in this new workflow (for now, since it is not officially supported yet) is:
Then you can use the same steps as before to run the model in runtime. Made a tracker for the new workflow here: #1420; we will add a Colab tutorial on adding new models. |
Great, thank you for your reply, LGTM! |
Could you please add Microsoft Phi-1.5 and Phi-2 to the list (#905)? Thanks. |
@niutech Phi1.5 and phi2 are now supported: https://llm.mlc.ai/docs/prebuilt_models.html#level-1-supported-model-architectures-the-all-in-one-table |
Replacing this page with https://github.com/orgs/mlc-ai/projects/2. Submit a request to the dashboard following #1042 |
Overview
This is a tracker for all new model requests from the community. The end goal is to have prebuilt models for all these models (either from us, or from the community -- see below).
Help Wanted
We would really appreciate it if you could contribute to the project by compiling any requested model listed below.
You could:
For more information (e.g. what architectures are currently supported) see: https://mlc.ai/mlc-llm/docs/prebuilt_models.html
To see what prebuilt models we currently have: https://huggingface.co/mlc-ai
Thank you!
Models with Supported Architecture
georgesung/llama2_7b_chat_uncensored
FlagAlpha/Llama2-Chinese-7b-Chat
Tap-M/Luna-AI-Llama2-Uncensored (q3)
Guanaco
HuggingFaceH4/starchat-alpha
Moss
CodeLlama (f32 in wasam specifically)
Models with Unsupported Architecture
Others
The text was updated successfully, but these errors were encountered: