first sequence than structure or the other way around #141

l-i-g · 2024-11-15T13:19:06Z

Thanks for providing this comprehensive model open-source.

I was wondering what the correct sequence of predictions is:

In your examples, e.g. with the Carbonic Anhydrase (2vvb) the following order is used:

   masked sequence prompt => predicted sequence track => predicted backbone structure

For the GFP evolution gfp_design.ipynb (related to your publication) the order is:

  heavily masked sequence & masked structure => structure tokens =>  generated sequence tokens ==> purged structure tokens => freshly generated structure token

In my particular case I have a heavily masked sequence track with the unmasked amino acids with coordinates provided and a secondary structure track with only a few masked position. Is it better to to first predict the structure or first predict the sequence?

Or more general: Is their a general rule which track should be predicted first?

Thanks for any feed-back

The text was updated successfully, but these errors were encountered:

ebetica · 2024-12-04T22:00:40Z

Unfortunately, I don't have a general rule on which tracks to sample. It differs from prompt to prompt, and also depends on what you're trying to predict. If you're looking at a problem that's very structural, then first predicting structure tokens is the way to go.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

first sequence than structure or the other way around #141

first sequence than structure or the other way around #141

l-i-g commented Nov 15, 2024

ebetica commented Dec 4, 2024

first sequence than structure or the other way around #141

first sequence than structure or the other way around #141

Comments

l-i-g commented Nov 15, 2024

ebetica commented Dec 4, 2024