Adding new model #32

samarth1612 · 2024-09-02T09:31:48Z

The below mentioned point where the link for GPTModel is broken and can't get the exact config for the yaml file hence not able to add new model (llama2-13b) as not able to profile the data for that model.

Add a YAML model config for the new model in data/model_configs.

Use the model's HuggingFace model id for the file name eg. data/model_configs/meta-llama/Llama-2-70b-hf.yml.
Refer HuggingFace config.json for the model eg. https://huggingface.co/meta-llama/Llama-2-70b-hf/blob/main/config.json.
Ensure that correct parameters are set in the YAML file so that the reference transformer model GPTModel closely resembles the new model.
We use this reference model to profile only the MLP operations of all the models so the attention operations are no-op'ed here.

Would like to know if there is any solution for this as even the model_configs folder is not present in the base directory.

AgrawalAmey · 2024-09-05T19:24:43Z

@anmolagarwalcp810 please share the updated instructions? and let's also make sure that we reflect those in the docs.

samarth1612 · 2024-09-17T10:41:35Z

Any update on this front ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding new model #32

Adding new model #32

samarth1612 commented Sep 2, 2024

AgrawalAmey commented Sep 5, 2024

samarth1612 commented Sep 17, 2024

Adding new model #32

Adding new model #32

Comments

samarth1612 commented Sep 2, 2024

AgrawalAmey commented Sep 5, 2024

samarth1612 commented Sep 17, 2024