Added custom mamba op and fix the mamba cache issue #1521

zzhang37 · 2024-11-22T18:47:51Z

What does this PR do?

Added custom mamba pscan op and fixed the mamba cache issue

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

libinta

@zzhang37 please update test to reflect the change

ssarkar2 · 2024-11-25T19:31:45Z

optimum/habana/transformers/models/mamba/modeling_mamba.py

 from transformers.utils import (
    ModelOutput,
    logging,
 )

+from pathlib import Path
+import os
+base_dir = "/workspace/custom_op_pscan_all" 


@libinta what would be the best way to set this without hardcoding?
Atleast an env var?

Is this dir generated on the fly? Or is it supposed to be downloaded (e.g. as part of an example) ?

I will change based on our relative folder location

Added HABANA_CUSTOM_OP_DIR for custom op lib folder or using the current folder as lib folder.

ssarkar2 · 2024-11-25T19:33:33Z

optimum/habana/transformers/models/mamba/modeling_mamba.py

+    A, D are input independent (see Mamba paper [1] Section 3.5.2 "Interpretation of A" for why A isn't selective)
+    ∆, B, C are input-dependent (this is a key difference between Mamba and the linear time invariant S4,
+    and is why Mamba is called **selective** state spaces)
+    """


@zzhang37 can you plz add a comment in the code about the different between this and original impl?

ssarkar2 · 2024-11-25T19:34:12Z

optimum/habana/transformers/models/mamba/modeling_mamba.py

+
+    # fmt: off
+    def slow_forward(self, input_states, cache_params: Optional[MambaCache]=None, cache_position:Optional[torch.LongTensor]=None, attention_mask: Optional[torch.LongTensor] = None):
+        batch_size, seq_len, _ = input_states.shape


@zzhang37 , can u plz add a brief code comment about the difference between this and original.

is it only Run_Mamba_Forward_Gaudi ?

jiminha · 2024-11-26T00:13:20Z

@zzhang37 Also, all synapse dependencies merged in to 1.19 release?

Added custom mamba op and fix the mamba cache issue

545f357

zzhang37 requested a review from regisss as a code owner November 22, 2024 18:47

libinta reviewed Nov 25, 2024

View reviewed changes

ssarkar2 reviewed Nov 25, 2024

View reviewed changes

Added custom mamba op and fix the mamba cache issue

b5ea53e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added custom mamba op and fix the mamba cache issue #1521

Added custom mamba op and fix the mamba cache issue #1521

zzhang37 commented Nov 22, 2024

libinta left a comment

ssarkar2 Nov 25, 2024

regisss Nov 29, 2024

zzhang37 Dec 2, 2024 •

edited

Loading

zzhang37 Dec 4, 2024

zzhang37 Dec 4, 2024

ssarkar2 Nov 25, 2024

zzhang37 Dec 4, 2024

ssarkar2 Nov 25, 2024

jiminha commented Nov 26, 2024

Added custom mamba op and fix the mamba cache issue #1521

Are you sure you want to change the base?

Added custom mamba op and fix the mamba cache issue #1521

Conversation

zzhang37 commented Nov 22, 2024

What does this PR do?

Before submitting

libinta left a comment

Choose a reason for hiding this comment

ssarkar2 Nov 25, 2024

Choose a reason for hiding this comment

regisss Nov 29, 2024

Choose a reason for hiding this comment

zzhang37 Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

zzhang37 Dec 4, 2024

Choose a reason for hiding this comment

zzhang37 Dec 4, 2024

Choose a reason for hiding this comment

ssarkar2 Nov 25, 2024

Choose a reason for hiding this comment

zzhang37 Dec 4, 2024

Choose a reason for hiding this comment

ssarkar2 Nov 25, 2024

Choose a reason for hiding this comment

jiminha commented Nov 26, 2024

zzhang37 Dec 2, 2024 •

edited

Loading