add Qwen2-VL static generation #1512

Spycsh · 2024-11-22T06:58:34Z

What does this PR do?

Add Qwen2-VL static generation.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Spycsh · 2024-11-22T07:03:03Z

The pipeline test will not pass until optimum-habana matches the latest changes in transformers huggingface/transformers#34769, namely the task name image-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

jiminha · 2024-11-25T18:01:32Z

age-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

Are you saying we need transformer4.47 for this to work or can you update the examples/images-to-text/run_pipeline to support both cases?

jiminha · 2024-11-25T22:16:50Z

optimum/habana/transformers/generation/utils.py

@@ -974,14 +979,16 @@ def generate(
        # 1. Handle `generation_config` and kwargs that might update it, and validate the `.generate()` call
        self._validate_model_class()
        tokenizer = kwargs.pop("tokenizer", None)  # Pull this out first, we only use it for stopping criteria
+        #assistant_tokenizer = kwargs.pop("assistant_tokenizer", None)  # only used for assisted generation


please remove all this commented out codes.

I think it is also affected by transformers 4.47. Will remove or uncomment it when transformers has its v4.47.0 release & optimum habana update to match 4.47.

jiminha · 2024-11-25T22:33:23Z

optimum/habana/transformers/generation/utils.py

        if hpu_graphs and not lazy_mode:
            raise ValueError(
                "`hpu_graphs` is True but `lazy_mode` is False. HPU graphs require `lazy_mode` to be set to True."
            )
        num_virtual_tokens = kwargs.pop("num_virtual_tokens", 0)
        generation_config, model_kwargs = self._prepare_generation_config(generation_config, **kwargs)
        self._validate_model_kwargs(model_kwargs.copy())
-        self._validate_assistant(assistant_model)
+        #self._validate_assistant(assistant_model, tokenizer, assistant_tokenizer)
+        self._validate_assistant(assistant_model,)



Is this file(utils.py) change needed? please remove if it's not needed.

Same as above.

jiminha · 2024-11-25T23:29:51Z

@tthakkal Could you also review this please. THanks.

Spycsh · 2024-11-27T03:15:41Z

age-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

Are you saying we need transformer4.47 for this to work or can you update the examples/images-to-text/run_pipeline to support both cases?

Yes. An update to latest transformers is needed here. run_pipeline.py also need to be updated correspondingly. I will look into this and get back to you later.

…nto qwen2_vl

Spycsh and others added 20 commits November 4, 2024 06:37

draft qwen2-vl on hpu

b0768e3

fix name

0317aa1

prepare

aaac6bc

baseline prepare

55b317f

prefill pass

47e8763

decode pass, fix perf issue

4587610

remove useless sync with hpu graph

05b429c

remove debug info

e65c1d6

add comment

c5d44f5

fix

3081a66

ruff

2ec286f

add test

e9b69b9

fix img path

ae7c395

fix img path

0e848c6

merge

a72b930

fix

d8ed9f0

revert assistant

bd73757

Merge remote-tracking branch 'source/main' into qwen2_vl

1bd5895

add default

5e9dda1

Merge branch 'huggingface:main' into qwen2_vl

5106dc3

Spycsh requested review from ssarkar2, bhargaveede, vivekgoe and regisss as code owners November 22, 2024 06:58

jiminha reviewed Nov 25, 2024

View reviewed changes

jiminha requested changes Nov 25, 2024

View reviewed changes

Spycsh added 3 commits November 26, 2024 22:26

merge

7ab207e

explicit synchronize without hpu graph

10baae2

Merge branch 'qwen2_vl' of https://github.com/Spycsh/optimum-habana i…

c04b90a

…nto qwen2_vl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Qwen2-VL static generation #1512

add Qwen2-VL static generation #1512

Spycsh commented Nov 22, 2024

Spycsh commented Nov 22, 2024

jiminha commented Nov 25, 2024

jiminha Nov 25, 2024

Spycsh Nov 27, 2024 •

edited

Loading

jiminha Nov 25, 2024

Spycsh Nov 27, 2024

jiminha commented Nov 25, 2024

Spycsh commented Nov 27, 2024 •

edited

Loading

add Qwen2-VL static generation #1512

Are you sure you want to change the base?

add Qwen2-VL static generation #1512

Conversation

Spycsh commented Nov 22, 2024

What does this PR do?

Before submitting

Spycsh commented Nov 22, 2024

jiminha commented Nov 25, 2024

jiminha Nov 25, 2024

Choose a reason for hiding this comment

Spycsh Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

jiminha Nov 25, 2024

Choose a reason for hiding this comment

Spycsh Nov 27, 2024

Choose a reason for hiding this comment

jiminha commented Nov 25, 2024

Spycsh commented Nov 27, 2024 • edited Loading

Spycsh Nov 27, 2024 •

edited

Loading

Spycsh commented Nov 27, 2024 •

edited

Loading