How do I modify to run with gpt2-xl (1558M) parameters? #17

jmarsil · 2019-11-21T23:38:52Z

Any help would be greatly appreciated!

jasonzhou1 · 2019-12-01T03:19:35Z

I was able to find the s3 bucket locations of the pretrained GPT2 models here: https://github.com/huggingface/transformers/blob/master/transformers/modeling_gpt2.py (provided by HuggingFace).

To make this work, just download gpt2-xl model instead:

curl --output gpt2-pytorch_model.bin https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-xl-pytorch_model.bin

paulbricman · 2019-12-17T17:08:11Z

@jasonzhou1 I only get gibberish output with the XL model, worse than the small version. Have you actually had any luck with it?

Update: also tried the other models linked to in the script you referenced, also without luck.

ZJiaBin · 2020-03-09T06:29:40Z

@jasonzhou1 I only get gibberish output with the XL model, worse than the small version. Have you actually had any luck with it?

Before you try gpt-2-ml model,some parameters in gpt-2-Pytorch/GPT2/config.py should be modified , like n-heads=25 , n_embd=1600 , n_layer=25, or you can see details here https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-xl-config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I modify to run with gpt2-xl (1558M) parameters? #17

How do I modify to run with gpt2-xl (1558M) parameters? #17

jmarsil commented Nov 21, 2019

jasonzhou1 commented Dec 1, 2019

paulbricman commented Dec 17, 2019 •

edited

Loading

ZJiaBin commented Mar 9, 2020 •

edited

Loading

How do I modify to run with gpt2-xl (1558M) parameters? #17

How do I modify to run with gpt2-xl (1558M) parameters? #17

Comments

jmarsil commented Nov 21, 2019

jasonzhou1 commented Dec 1, 2019

paulbricman commented Dec 17, 2019 • edited Loading

ZJiaBin commented Mar 9, 2020 • edited Loading

paulbricman commented Dec 17, 2019 •

edited

Loading

ZJiaBin commented Mar 9, 2020 •

edited

Loading