How can we get attention weights from example sequence and structure? #119

what-is-what · 2024-10-08T01:36:28Z

How can we get attention weights from example sequence and structure?
There were no arguments to get attention weights in transformer blocks, unlike esm2.

gelnesr · 2024-10-20T05:26:09Z

also interested in this feature, if available!

ebetica · 2024-11-11T21:04:03Z

Unfortunately, pytorch flash attention doesn't let you do this. You'll have to hack it in, we'll look into support it officially. Here's where the attention is computed, you'll just have use a pytorch implementation of attention to expose the attention matrix.

esm/esm/layers/attention.py

Lines 62 to 75 in 39a3a6c

    
           if seq_id is not None: 
        
               # Where True, enable participation in attention. 
        
               mask_BLL = seq_id.unsqueeze(-1) == seq_id.unsqueeze(-2) 
        
               mask_BHLL = mask_BLL.unsqueeze(1) 
        
               context_BHLD = F.scaled_dot_product_attention( 
        
                   query_BHLD, key_BHLD, value_BHLD, mask_BHLL 
        
               ) 
        
           else: 
        
               # Shortcut, if we don't use attention biases then torch 
        
               # will autoselect flashattention as the implementation 
        
               context_BHLD = F.scaled_dot_product_attention( 
        
                   query_BHLD, key_BHLD, value_BHLD 
        
               )

ebetica mentioned this issue Nov 11, 2024

About Getting Protein Sequence Embeddings #128

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can we get attention weights from example sequence and structure? #119

How can we get attention weights from example sequence and structure? #119

what-is-what commented Oct 8, 2024

gelnesr commented Oct 20, 2024

ebetica commented Nov 11, 2024

How can we get attention weights from example sequence and structure? #119

How can we get attention weights from example sequence and structure? #119

Comments

what-is-what commented Oct 8, 2024

gelnesr commented Oct 20, 2024

ebetica commented Nov 11, 2024