Skip to content

Commit

Permalink
Update a comment per NVIDIA/TransformerEngine#1350
Browse files Browse the repository at this point in the history
  • Loading branch information
wujingyue committed Dec 14, 2024
1 parent d53be45 commit 1846efe
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions tests/python/test_transformer_engine.py
Original file line number Diff line number Diff line change
Expand Up @@ -77,8 +77,9 @@ def test_transformer_layer(setup_process_group, benchmark, compute_type, paralle
hidden_size,
ffn_hidden_size,
num_heads,
# https://github.com/NVIDIA/TransformerEngine/issues/1350: the
# benchmark fails to execute on H100 with the default format (SBHD).
# According to https://github.com/NVIDIA/TransformerEngine/issues/1350,
# `attn_input_format` has to match the format of `transformer_layer`'s
# input.
attn_input_format="bshd",
set_parallel_mode=True,
sequence_parallel=(parallelism == Parallelism.SEQUENCE_PARALLEL),
Expand Down

0 comments on commit 1846efe

Please sign in to comment.