-
Notifications
You must be signed in to change notification settings - Fork 252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Converting models from GPT-NeoX to HuggingFace format #33
Comments
Hi, that's great to hear. The basic steps should be the following:
Let me know if you have further questions or updates, and I hope it isn't too much trouble! -Vincent |
Thanks for your directions Vincent, I will follow them and if I run into issues or have questions, I will let you know. |
Hello @VHellendoorn, Please what is the server spec that you used for training/inferencing the model? also if you could point me to the provider that you rented it from. Thanks |
Hi @sleekmike , I'm sure that Vincent will respond soon, as far as I remember the models were trained on 8 Nvidia RTX 8000 GPUs on a single machine. But I wonder why is that a concern? Don't you rather start with our smaller models that can fit any GPU? |
@urialon Thanks for your response, I just want to get an idea, I will start with a much smaller model like you said. |
Hello everyone, I have tried to convert the GPT-NeoX model to HuggingFace format, and it has been verified that it can work properly on the 0.4B model, please refer to this PR. |
Thanks a lot @NinedayWang ! @VHellendoorn what are the next needed steps? do we need to push code to |
Yes, thanks @NinedayWang! I'll try it out as soon as I have some time. In terms of next steps: if this just works with the HF classes, which it sounds like it does, the next step is to just add a how-to to the README and clean up #34 a bit to work for all published models (the vocabulary will be identical, the script probably too, so just the distilled config file needs to be replicated). |
@VHellendoorn , note that it depends on @NinedayWang 's PR to |
Hello,
I am interested and volunteering to convert the models from GPT-NeoX to HuggingFace format.
The text was updated successfully, but these errors were encountered: