Possible to train Llama 3.1? #133

mosh98 · 2024-07-30T09:29:02Z

Hi,

I tried training llama 3.1 with run_mntp.py but get an obsucre error

AttributeError: 'LlamaBiModel' object has no attribute 'rotary_emb'

What is that about ?

The text was updated successfully, but these errors were encountered:

bzantium · 2024-08-01T13:33:59Z

you can check this: 03382c3

mosh98 · 2024-08-02T11:08:53Z

hmm still not sure what to do...

stefanhgm · 2024-08-05T12:46:58Z

Hi everyone,

@bzantium thanks for pointing us to the commit. I added the respective lines and used a more recent version of transformers to make it work. MNTP training for Llama 3.1 seems to work now for me. However, I failed to do the MTEB evaluation locally so far, see #123.

Did you make any progress in training Llama 3.1 for LLM2Vec?

mosh98 · 2024-08-05T13:44:58Z

@bzantium Thanks i was able to get the embeddings after adding in the lines, haven't been able to train it yet through MNTP but i'll keep on trying

andupotorac · 2024-08-12T22:22:51Z

@stefanhgm Once Llama 3.1 (I presume the 8B parameters model) is trained, can you use it for generating images the way ELLA uses t5, with better prompt adherence?

stefanhgm · 2024-08-13T12:52:37Z

@andupotorac I am not familiar with the ELLA project, but you could use the model to create embeddings just as with the other LLM2Vec models.

However, the eval on MTEB currently hangs #135

andupotorac · 2024-08-13T15:40:10Z

Thanks, I will keep an eye on it as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible to train Llama 3.1? #133

Possible to train Llama 3.1? #133

mosh98 commented Jul 30, 2024

bzantium commented Aug 1, 2024 •

edited

Loading

mosh98 commented Aug 2, 2024

stefanhgm commented Aug 5, 2024

mosh98 commented Aug 5, 2024

andupotorac commented Aug 12, 2024

stefanhgm commented Aug 13, 2024

andupotorac commented Aug 13, 2024

Possible to train Llama 3.1? #133

Possible to train Llama 3.1? #133

Comments

mosh98 commented Jul 30, 2024

bzantium commented Aug 1, 2024 • edited Loading

mosh98 commented Aug 2, 2024

stefanhgm commented Aug 5, 2024

mosh98 commented Aug 5, 2024

andupotorac commented Aug 12, 2024

stefanhgm commented Aug 13, 2024

andupotorac commented Aug 13, 2024

bzantium commented Aug 1, 2024 •

edited

Loading