-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generate llama instead of downloading it #250
Conversation
cc8a7c3
to
de3ffda
Compare
This should work for 8B and 70B llama3 models. To test it tho there's 2 points where I would need tips:
|
We could try to use in the config to use the original checkpoints.
instead
|
42f3aed
to
6da9faa
Compare
Could you try to see if we are able to load the |
Sure I'll try but with the
But I think it should work as there's an option to offload on CPU :
|
6da9faa
to
eca81da
Compare
Tested with 8 GPUs and with 4 GPUs and ~600G RAM |
Would you use the cloud-ci to test this? |
8b35c3d
to
282e9c2
Compare
* rename huggingface token to MILABENCH_* to automatically forward the env var to a remote in such cases
282e9c2
to
39222b8
Compare
model-00028-of-00030.safetensors, | ||
model-00029-of-00030.safetensors, | ||
model-00030-of-00030.safetensors, | ||
model-00001-of-00062.safetensors, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cela serait bien si les poids étaient interchangeables
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Qu'on fasse la génération du fichier .yaml après avoir généré les fichiers .safetensors?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
que la génération génère 30 safetensors pour qu'on puisse changer entre pretrained & generated sans problème
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done here : #278
No description provided.