You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Then I run python artifacts/main.py artifacts/config/config.yaml --type ACTOR. main.py calls ActorTrainer from actor.py, which calls load_model from llama_model.py, which calls load_checkpoints, which expects to find *.pth files and a params.json file in my ckpt_dir which is ./models. My models folder has neither . I wonder where these file have to come from.
The text was updated successfully, but these errors were encountered:
Hi @StrangeTcy, thanks for reaching out.
The first time that the model is loaded, you probably won't have the checkpoints dir in /models.
The folder is created when during training checkpoints are saved and the folder gets populated.
To specify the models from HF you just need to type in the config.yaml in the model field the name of the model from HF that is passed to transformer.AutoModel() when instantiating the model.
Be aware that HF itself had an issue when loading the tokenizer for llama. You may need to check that if it is still an issue.
The first time the model is loaded from ./models, there are indeed no checkpoints there, but they can be downloaded with the python or bash script from pyllama.
As for HF models and LLaMA, HF transformers are indeed handled by the
self.model = AutoModelForCausalLM.from_pretrained(
config.model,
)
``` in the `actor.py`, but pure llama models go through `load_model` from `llama_model.py`.
I guess I should try something like `decapoda-research/llama-7b-hf` as an HF model instead of the single-file llama checkpoint
I'm following the example from your README, and it works like this:
my
config.yaml
has these line for the actor:Then I run
python artifacts/main.py artifacts/config/config.yaml --type ACTOR
.main.py
callsActorTrainer
fromactor.py
, which callsload_model
fromllama_model.py
, which callsload_checkpoints
, which expects to find*.pth
files and aparams.json
file in myckpt_dir
which is./models
. Mymodels
folder has neither . I wonder where these file have to come from.The text was updated successfully, but these errors were encountered: