-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
finetuning with MLM task #70
Comments
One more thing, |
Hi, thank you for your interest in our work and asking some intriguing questions!
Yes. Setting
Interesting question! I think it depends on what kind of MLM task the model was pre-trained on. For SaProt, we didn't force it to predict structure tokens and only amino acid tokens were predicted to compute loss. We discussed this thing in our paper Appendix F. In this case SaProt may not be endowed the capability to do structure prediction. On the contrary, it indeed could do protein sequence design given a structure backbone, see our latest SaprotHub papar Fig 1.g.
Yes. If you check our code https://github.com/westlake-repl/SaProt/blob/main/model/saprot/base.py you would find we have already implemented LoRA technique for model fine-tuning. You could simply enable it by setting Hope this could resolve your quesitons. Let me know if I could further help you :) |
Thanks again for your amazingly helpful and quick reply! |
Hi, again :) After I finished studying yours codes, However, the usage of .mdb file is quite tricky to me. So my question is,
Many thanks for your help! |
Hi, Apology for the late reply! I've been busy working on another project and didn't see the new issue:(
I think it is not suitble to do so. Our code is based on the
Sure! Please refer to this reply #72 (comment) and you could generate your own Hope this could resolve your problem:) |
Awesome work :)
I am thinking about finetuning this model on specific protein domain with MLM task.
As in, starting from the SaProt model weights, further finetuning on un-labeled dataset.
python scripts/training.py -c config/Pretrain/saprot.yaml
I only need to change the load_pretrained: True ??
I also want to ask your opinion on this
Would it be possible to use this model as kind of structure prediction?
(sequence given -> structure token recovery -> 3D structure reconstruction)
Or vise versa (i.e. sequence design given structure)
Thank you in advance for your comment!
The text was updated successfully, but these errors were encountered: