[Chatllama] what's supposed to be in the Actor checkpoint dir? #337

StrangeTcy · 2023-04-18T14:02:03Z

I'm following the example from your README, and it works like this:
my config.yaml has these line for the actor:

actor_config:
  model: "decapoda-research/llama-7b-hf"
  model_folder: "./models"

Then I run python artifacts/main.py artifacts/config/config.yaml --type ACTOR.
main.py calls ActorTrainer from actor.py, which calls load_model from llama_model.py, which calls load_checkpoints, which expects to find *.pth files and a params.json file in my ckpt_dir which is ./models. My models folder has neither . I wonder where these file have to come from.

The text was updated successfully, but these errors were encountered:

StrangeTcy · 2023-04-18T14:21:40Z

Oh. It seems like you mean the single file you'd get from running llama.donwload from pyllama. Let me try it out...

PierpaoloSorbellini · 2023-04-19T07:20:42Z

Hi @StrangeTcy, thanks for reaching out.
The first time that the model is loaded, you probably won't have the checkpoints dir in /models.

The folder is created when during training checkpoints are saved and the folder gets populated.

To specify the models from HF you just need to type in the config.yaml in the model field the name of the model from HF that is passed to transformer.AutoModel() when instantiating the model.

Be aware that HF itself had an issue when loading the tokenizer for llama. You may need to check that if it is still an issue.

StrangeTcy · 2023-04-19T10:16:09Z

The first time the model is loaded from ./models, there are indeed no checkpoints there, but they can be downloaded with the python or bash script from pyllama.

As for HF models and LLaMA, HF transformers are indeed handled by the

 self.model = AutoModelForCausalLM.from_pretrained(
                config.model,
            )
            ``` in the `actor.py`, but pure llama models go through `load_model` from `llama_model.py`.
            
I guess I should try something like `decapoda-research/llama-7b-hf` as an HF model instead of the single-file llama checkpoint

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chatllama] what's supposed to be in the Actor checkpoint dir? #337

[Chatllama] what's supposed to be in the Actor checkpoint dir? #337

StrangeTcy commented Apr 18, 2023

StrangeTcy commented Apr 18, 2023

PierpaoloSorbellini commented Apr 19, 2023 •

edited

Loading

StrangeTcy commented Apr 19, 2023

[Chatllama] what's supposed to be in the Actor checkpoint dir? #337

[Chatllama] what's supposed to be in the Actor checkpoint dir? #337

Comments

StrangeTcy commented Apr 18, 2023

StrangeTcy commented Apr 18, 2023

PierpaoloSorbellini commented Apr 19, 2023 • edited Loading

StrangeTcy commented Apr 19, 2023

PierpaoloSorbellini commented Apr 19, 2023 •

edited

Loading