Ideal max_length for macaw-11b #15

sanxchep · 2022-05-25T12:57:44Z

Hi,
I just wanted to know what would be the ideal tokenizer model_max_length/max_length during inference of the model.
Does max_length affect generation quality of questions? If yes, then can you briefly explain me why.

Thanksss

P.S - I've been using 2048 as my max length.

yeswanthkuruba · 2022-06-10T10:51:00Z

Ideal max length doesn't depends in the parameter size of the model. it depends on the trained data. As long as your computation support you can increase the max length but, the better output would be generated based on the trained data max length.

So I prefer to use the trained data max length as inference max length.

sanxchep · 2022-06-10T12:19:26Z

Ideal max length doesn't depends in the parameter size of the model. it depends on the trained data. As long as your computation support you can increase the max length but, the better output would be generated based on the trained data max length.

So I prefer to use the trained data max length as inference max length.

Okay so then what was the max_length used during training?
Are any training scripts provided?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ideal max_length for macaw-11b #15

Ideal max_length for macaw-11b #15

sanxchep commented May 25, 2022

yeswanthkuruba commented Jun 10, 2022

sanxchep commented Jun 10, 2022

Ideal max_length for macaw-11b #15

Ideal max_length for macaw-11b #15

Comments

sanxchep commented May 25, 2022

yeswanthkuruba commented Jun 10, 2022

sanxchep commented Jun 10, 2022