generation utils update (minor) #1468

yafshar · 2024-11-01T20:12:57Z

What does this PR do?

Fix import path for streamers module from transformers.streamers -> transformers.generation.streamers
Fix the _prepare_decoder_attention_mask interface interface
- return x.index_fill(1, torch.tensor(0), 1) uses the wrong index of torch.tensor(0), it is fixed to the correct index on the correct device index = torch.tensor(0, device=device)
Improve the _pad_past_key_values function

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

- Fix the type hint, dtype can not be a str - Fix the device hint - Remove the pad token id arg, the decoder_attention_mask is a binary of 0, and 1

- Added an early return - Extracted is_mqa_model and lazy_mode to avoid repeated dictionary lookups - Used more descriptive variable names and simplified the nested loops for better readability

yafshar · 2024-11-08T21:24:20Z

The text-generation CI has been executed and will be compared with the main branch once the run is complete.

emascarenhas

@yafshar , Just a couple of comments below.
Please post results of CI, before and after change.

optimum/habana/transformers/generation/utils.py

emascarenhas · 2024-11-15T16:10:08Z

@yafshar , Makes sense.
Please post the CI results when available and we can move it further along.

emascarenhas · 2024-11-27T23:33:07Z

@yafshar , Could you post CI results. Thanks.

regisss

LGTM!

Aligned with @emascarenhas, let's make sure there is no regression in generation tests and then I'll merge it 🙂

HuggingFaceDocBuilderDev · 2024-11-29T14:01:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yafshar · 2024-12-02T23:34:58Z

I am doing slow CI tests python -m pytest tests/test_text_generation_example.py tests/test_encoder_decoder.py -v -s Till now all have been matched exactly. I will finish all the tests by tomorrow

emascarenhas · 2024-12-03T18:28:10Z

I am doing slow CI tests python -m pytest tests/test_text_generation_example.py tests/test_encoder_decoder.py -v -s Till now all have been matched exactly. I will finish all the tests by tomorrow

@yafshar , Can you post results here from the CI tests. Thanks.

yafshar · 2024-12-03T19:09:08Z

I just finished the CI tests

on both main and this PR on the same machine

>>> python -m pytest tests/test_text_generation_example.py tests/test_encoder_decoder.py -v -s
4 failed, 59 passed

I checked the failures

-> test_text_generation_bf16_1x[token0-EleutherAI/gpt-j-6b-1-False-160.5823842101192-False]
pr:   FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-google/gemma-7b-1-False-109.70751574382221-True] - AssertionError: assert 'DeepSpeed is...be efficient,' == 'DeepSpeed is... PyTorch, and'
main: FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-google/gemma-7b-1-False-109.70751574382221-True] - AssertionError: assert 'DeepSpeed is...be efficient,' == 'DeepSpeed is... PyTorch, and'

-> test_text_generation_bf16_1x[token0-state-spaces/mamba-130m-hf-1536-False-5385.511100161605-False]
pr:   FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-state-spaces/mamba-130m-hf-1536-False-5385.511100161605-False] - assert 4895.173518373703 >= ((2 - 1.05) * 5385.511100161605)
main: FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-state-spaces/mamba-130m-hf-1536-False-5385.511100161605-False] - assert 4895.212904578489 >= ((2 - 1.05) * 5385.511100161605)

-> test_text_generation_bf16_1x[token0-Deci/DeciLM-7B-1-False-120-False]
pr:   FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-Deci/DeciLM-7B-1-False-120-False] - assert 107.58924903315328 >= ((2 - 1.05) * 120)
main: FAILED tests/test_text_generation_example.py::test_text_generation_bf16_1x[token0-Deci/DeciLM-7B-1-False-120-False] - assert 107.56332773820075 >= ((2 - 1.05) * 120)

-> test_text_generation_fp8[token0-tiiuae/falcon-180B-4-950-True-128-128-2506.68]
pr:   FAILED tests/test_text_generation_example.py::test_text_generation_fp8[token0-tiiuae/falcon-180B-4-950-True-128-128-2506.68] - AssertionError: The following command failed:
main: FAILED tests/test_text_generation_example.py::test_text_generation_fp8[token0-tiiuae/falcon-180B-4-950-True-128-128-2506.68] - AssertionError: The following command failed:

The failures are exactly the same.

yafshar · 2024-12-03T19:11:59Z

@regisss @emascarenhas I do not see any regression. The behavior is the same as far as I tested

yafshar added 3 commits November 1, 2024 11:02

Fix import path for streamers module

d6b7323

Fix the _prepare_decoder_attention_mask interface

eadc356

- Fix the type hint, dtype can not be a str - Fix the device hint - Remove the pad token id arg, the decoder_attention_mask is a binary of 0, and 1

Improve the _pad_past_key_values

d91ea70

- Added an early return - Extracted is_mqa_model and lazy_mode to avoid repeated dictionary lookups - Used more descriptive variable names and simplified the nested loops for better readability

yafshar marked this pull request as ready for review November 4, 2024 17:36

yafshar requested review from ssarkar2, bhargaveede and vivekgoe as code owners November 4, 2024 17:36

yafshar changed the title ~~generation utils update~~ generation utils update (minor) Nov 5, 2024

Merge branch 'main' into generation

6bf985b

yafshar added 2 commits November 10, 2024 06:31

Update the list in place

c19ea36

Resolve the merge conflict

0bb308c

emascarenhas reviewed Nov 15, 2024

View reviewed changes

optimum/habana/transformers/generation/utils.py Show resolved Hide resolved

optimum/habana/transformers/generation/utils.py Show resolved Hide resolved

yafshar added 9 commits November 19, 2024 04:51

Merge branch 'main' into generation

b2542a9

Merge branch 'main' into generation

062eb4f

Merge branch 'main' into generation

f7bca23

Merge branch 'main' into generation

8008528

Merge branch 'main' into generation

2579ff3

Merge branch 'main' into generation

79ff3c1

Merge branch 'main' into generation

41482e1

Merge branch 'main' into generation

3bfd16b

Merge branch 'main' into generation

ed4d3d8

Merge branch 'main' into generation

0e4fcef

regisss approved these changes Nov 29, 2024

View reviewed changes

yafshar added 3 commits December 2, 2024 04:37

Merge branch 'main' into generation

31e9d24

Merge branch 'main' into generation

d9e7fd1

Merge branch 'main' into generation

e677077

Merge branch 'main' into generation

e933149

Merge branch 'main' into generation

5a281f5

yafshar added 4 commits December 3, 2024 13:17

Merge branch 'main' into generation

1c301a0

Merge branch 'main' into generation

de0db36

Merge branch 'main' into generation

0266d22

Merge branch 'main' into generation

95dc6eb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generation utils update (minor) #1468

generation utils update (minor) #1468

yafshar commented Nov 1, 2024 •

edited

Loading

yafshar commented Nov 8, 2024 •

edited

Loading

emascarenhas left a comment

emascarenhas commented Nov 15, 2024

emascarenhas commented Nov 27, 2024

regisss left a comment

HuggingFaceDocBuilderDev commented Nov 29, 2024

yafshar commented Dec 2, 2024

emascarenhas commented Dec 3, 2024

yafshar commented Dec 3, 2024 •

edited

Loading

yafshar commented Dec 3, 2024

generation utils update (minor) #1468

Are you sure you want to change the base?

generation utils update (minor) #1468

Conversation

yafshar commented Nov 1, 2024 • edited Loading

What does this PR do?

Before submitting

yafshar commented Nov 8, 2024 • edited Loading

emascarenhas left a comment

Choose a reason for hiding this comment

emascarenhas commented Nov 15, 2024

emascarenhas commented Nov 27, 2024

regisss left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 29, 2024

yafshar commented Dec 2, 2024

emascarenhas commented Dec 3, 2024

yafshar commented Dec 3, 2024 • edited Loading

yafshar commented Dec 3, 2024

yafshar commented Nov 1, 2024 •

edited

Loading

yafshar commented Nov 8, 2024 •

edited

Loading

yafshar commented Dec 3, 2024 •

edited

Loading