Matmul Python Benchmarks #2088

Priya2698 · 2024-04-17T02:14:16Z

Fixes #2082

jacobhinkle · 2024-04-17T10:58:35Z

benchmarks/python/test_matmul.py

+
+def matmul_fusion(fd: FusionDefinition, dtype: DataType) -> None:
+    # Decide contiguity based on layout
+    a = fd.define_tensor(


Maybe we should pass a and b here instead as torch.Tensors then use fd.from_pytorch to define these. The reason is that this definition as stated will not adapt to different layouts, but we will get the right stride order for the operands from fd.from_pytorch.

That of course assumes we have #2058 fixed first. until then we probably need to pass in the layout or better yet bools indicating whether we need to transpose each operand first. For transposed operands we should allocate them like for example randn(k, m) instead of randn(m, k).

So, fd.from_pytorch, should be able to allocate the inputs correctly, right?
But fd.define_tensor with the contiguity flags will not work.

benchmarks/python/test_matmul.py

Priya2698 · 2024-04-17T20:00:40Z

For future, we should probably have a separate benchmark for baselines (eager / torchcompile).
While we run through ATen right now, those will be required later. Wdyt? @jacobhinkle. I can add them to this PR.

jacobhinkle · 2024-04-17T20:03:49Z

Maybe we should do that in another PR since the current one is enough to benchmark aten vs nvfuser using env vars.

Priya2698 · 2024-04-19T22:23:32Z

@jacobhinkle Should we merge this PR?

jacobhinkle

Full speed ahead! Thanks for the changes.

matmul benchmark

78911c8

jacobhinkle reviewed Apr 17, 2024

View reviewed changes

jacobhinkle added 3 commits April 17, 2024 12:47

Add license header

e48a2bd

Add matmul_problems.csv

bb84892

Load CSV to find benchmark problems

445bf4d

jacobhinkle reviewed Apr 17, 2024

View reviewed changes

benchmarks/python/test_matmul.py Show resolved Hide resolved

benchmarks/python/test_matmul.py Outdated Show resolved Hide resolved

review comments

7bee5c1

Priya2698 marked this pull request as ready for review April 17, 2024 19:24

fix stride order

8a64219

jacobhinkle approved these changes Apr 19, 2024

View reviewed changes

Priya2698 merged commit 5be8b33 into main Apr 19, 2024
4 checks passed

Priya2698 deleted the pm/benchmark_matmul branch April 19, 2024 22:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matmul Python Benchmarks #2088

Matmul Python Benchmarks #2088

Priya2698 commented Apr 17, 2024 •

edited by jacobhinkle

Loading

jacobhinkle Apr 17, 2024

jacobhinkle Apr 17, 2024

Priya2698 Apr 17, 2024 •

edited

Loading

Priya2698 commented Apr 17, 2024 •

edited

Loading

jacobhinkle commented Apr 17, 2024

Priya2698 commented Apr 19, 2024

jacobhinkle left a comment

Matmul Python Benchmarks #2088

Matmul Python Benchmarks #2088

Conversation

Priya2698 commented Apr 17, 2024 • edited by jacobhinkle Loading

jacobhinkle Apr 17, 2024

Choose a reason for hiding this comment

jacobhinkle Apr 17, 2024

Choose a reason for hiding this comment

Priya2698 Apr 17, 2024 • edited Loading

Choose a reason for hiding this comment

Priya2698 commented Apr 17, 2024 • edited Loading

jacobhinkle commented Apr 17, 2024

Priya2698 commented Apr 19, 2024

jacobhinkle left a comment

Choose a reason for hiding this comment

Priya2698 commented Apr 17, 2024 •

edited by jacobhinkle

Loading

Priya2698 Apr 17, 2024 •

edited

Loading

Priya2698 commented Apr 17, 2024 •

edited

Loading