Resolve conflicts by recomputation #3625

naoyam · 2024-12-20T00:43:49Z

Stacked on top of #3611

This PR resolves the conflicts found by the analysis added at #3611 by recomputing slice/pad input tensors. With this, fusions like ResizeSchedulerTest.SliceRotateCatResidual can be scheduled as a single kernel by the resize scheduler.

Recomputation is not the only possible way to resolve conflicts. We could do, for example, cache a block of a input tensor such that multiple uses of the input can be done by just using the block. This would look more like a produce-based scheduling approach. I prototyped that approach here, but it didn't perform well for RoPE.

naoyam · 2024-12-20T00:49:18Z

!test

…mputation

naoyam · 2024-12-20T05:11:59Z

!test

naoyam · 2024-12-20T06:54:17Z

!test

naoyam · 2024-12-20T18:06:43Z

csrc/scheduler/resize.cpp

@@ -133,6 +120,30 @@ bool ResizeScheduler::canScheduleCompileTime(Fusion* fusion) {
    return false;
  }

+  for (auto out_tv : ir_utils::filterByType<TensorView>(fusion->outputs())) {


This check is needed since the non-exclusivity check is dropped. It was redundant before.

naoyam · 2024-12-20T18:14:09Z

tests/cpp/test_resize.cpp

Most of the changes here are due to the change of the output type of getNonExclusiveResizeInfo and are just mechanical changes.

naoyam · 2024-12-24T01:16:42Z

Found a bug. Will update soon.

…mputation

naoyam · 2024-12-24T02:00:54Z

!test

jacobhinkle

LGTM

csrc/scheduler/resize.cpp

jacobhinkle · 2024-12-31T15:07:19Z

csrc/scheduler/resize.cpp

+                     /*require_all_to_visited=*/false)
+                     .first;
+    for (const auto& [expr_g, dir] : exprs) {
+      if (expr_g->front()->isA<Resize>()) {


Is my understanding correct that we will segment if we have one resized output and one not resized?

addInput(tv0) tv1 = 2 * tv0; tv2 = slice(tv1); tv3 = 3 * tv2; addOutput(tv2); addOutput(tv3);

In that case we will still have a resize between the tv2 and tv3 but it seems like we would potentially be able to schedule it like tv3.

Yes, it is definitely possible, but it's a non-trivial problem to pick a reference tensor. If the dimension of tv2 is not that different from tv1, either of tv2 or tv3 should be fine. However, if if they are significantly different, it's unclear if we should fuse them or not. For a trivial fusion like this, it's definitely better to fuse. For more complex fusions, we may be able to generate more efficient segmented kernels. At this point, this is not something I'm trying to address.

Co-authored-by: Jacob Hinkle <[email protected]>

naoyam · 2024-12-31T17:41:48Z

!build

naoyam · 2024-12-31T17:50:33Z

!build

naoyam added 5 commits December 18, 2024 08:37

Exclusiveness analysis

1a370e7

cleanup

ac5a1bc

cleanup

8b8c708

PR feedback

d364442

Resolve conflicts by recomputation

7380a40

naoyam force-pushed the resize_scheduler_recomputation branch from 5d1d07e to 7380a40 Compare December 20, 2024 00:48

Base automatically changed from resize_scheduler_exclusiveness to main December 20, 2024 04:19

naoyam added 2 commits December 19, 2024 21:01

Merge remote-tracking branch 'origin/main' into resize_scheduler_reco…

eb9fffa

…mputation

test fix

9631958

fix

76dbab9

naoyam commented Dec 20, 2024

View reviewed changes

cleanup

75338a4

naoyam requested a review from jacobhinkle December 20, 2024 18:16

naoyam marked this pull request as ready for review December 20, 2024 18:17

naoyam added the rope label Dec 20, 2024

naoyam marked this pull request as draft December 24, 2024 01:16

naoyam added 2 commits December 23, 2024 17:17

Merge remote-tracking branch 'origin/main' into resize_scheduler_reco…

221a323

…mputation

Recomputation needs to be done in a topological order

e48a2f6

naoyam marked this pull request as ready for review December 24, 2024 06:56

naoyam added 2 commits December 30, 2024 19:28

Merge branch 'main' into resize_scheduler_recomputation

f9a2d37

Merge branch 'main' into resize_scheduler_recomputation

d66a67d

jacobhinkle approved these changes Dec 31, 2024

View reviewed changes

naoyam and others added 2 commits December 31, 2024 09:41

Update csrc/scheduler/resize.cpp

b4f1391

Co-authored-by: Jacob Hinkle <[email protected]>

Merge branch 'main' into resize_scheduler_recomputation

c0e5b0a

format

b5cdfcf

naoyam merged commit f9d0efa into main Dec 31, 2024
14 of 15 checks passed

naoyam deleted the resize_scheduler_recomputation branch December 31, 2024 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve conflicts by recomputation #3625

Resolve conflicts by recomputation #3625

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam Dec 20, 2024

naoyam Dec 20, 2024

naoyam commented Dec 24, 2024

naoyam commented Dec 24, 2024

jacobhinkle left a comment

jacobhinkle Dec 31, 2024

naoyam Dec 31, 2024

naoyam commented Dec 31, 2024

naoyam commented Dec 31, 2024

Resolve conflicts by recomputation #3625

Resolve conflicts by recomputation #3625

Conversation

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam commented Dec 20, 2024

naoyam Dec 20, 2024

Choose a reason for hiding this comment

naoyam Dec 20, 2024

Choose a reason for hiding this comment

naoyam commented Dec 24, 2024

naoyam commented Dec 24, 2024

jacobhinkle left a comment

Choose a reason for hiding this comment

jacobhinkle Dec 31, 2024

Choose a reason for hiding this comment

naoyam Dec 31, 2024

Choose a reason for hiding this comment

naoyam commented Dec 31, 2024

naoyam commented Dec 31, 2024