Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

large model release, would you update it? #8

Open
chikiuso opened this issue Sep 1, 2020 · 7 comments
Open

large model release, would you update it? #8

chikiuso opened this issue Sep 1, 2020 · 7 comments

Comments

@chikiuso
Copy link

chikiuso commented Sep 1, 2020

hi I see the large model of gpt2 and the pretrained dialogpt based on it is released, would you consider include in this project? thanks!

@polakowo
Copy link
Owner

polakowo commented Sep 1, 2020

Yes, I'm going to update the package to the new transformers version soon.

@chikiuso
Copy link
Author

chikiuso commented Sep 2, 2020

thanks a lot :D

@ghost
Copy link

ghost commented Sep 22, 2020

Yeah, that would be awesome !

@polakowo
Copy link
Owner

I released a new version that accepts any text generation model, including large dialogpt (use model = microsoft/DialoGPT-large in chatbot.cfg).

@ghost
Copy link

ghost commented Sep 28, 2020

@polakowo

I made a fork and added Dockerfiles with GPU/CPU support. It works awesome !

Would be awesome to use it with the deeppavlov-agent...

Thanks again !

@polakowo
Copy link
Owner

@lucmichalski looks great! I had a Dockerfile previously but it downloaded the same model every time I deployed the container, which made it too expensive to play with. Maybe I should have synced caching dirs to avoid this.

@ghost
Copy link

ghost commented Sep 28, 2020

@polakowo

do you want me to do a PR ? I used a docker volume for the model

I made some modifications to the worktree https://github.com/lucmichalski/gpt2bot

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants