Add nonnegative_int type #1533

victorli2002 · 2024-10-31T04:51:03Z

Description of changes:

renamed ff_dim to ff_dim_t (uses nonnegative_int value)
added a nonnegative_int type with some unit tests
added a relative_ff_dim_t type (uses int value)
modified FFOrdered and slice functions to support new ff_dim_t and relative_ff_dim_t

This change is

codecov · 2024-10-31T20:31:05Z

Codecov Report

Attention: Patch coverage is 89.12752% with 81 lines in your changes missing coverage. Please review.

Project coverage is 79.05%. Comparing base (5e7f207) to head (22eda14).
Report is 5 commits behind head on master.

Files with missing lines	Patch %	Lines
...tils/include/utils/archetypes/ordered_value_type.h	0.00%	18 Missing ⚠️
lib/local-execution/src/ops/transpose.cc	0.00%	9 Missing ⚠️
lib/local-execution/src/ops/pool_2d.cc	0.00%	8 Missing ⚠️
lib/local-execution/src/ops/reverse.cc	0.00%	7 Missing ⚠️
lib/local-execution/src/ops/linear.cc	0.00%	6 Missing ⚠️
lib/local-execution/src/ops/split.cc	0.00%	5 Missing ⚠️
lib/pcg/src/pcg/computation_graph_builder.cc	77.27%	5 Missing ⚠️
...p-attrs/include/op-attrs/dim_ordered/dim_ordered.h	93.47%	3 Missing ⚠️
lib/op-attrs/src/op-attrs/ops/attention.cc	0.00%	3 Missing ⚠️
lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc	50.00%	3 Missing ⚠️
... and 8 more

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1533      +/-   ##
==========================================
+ Coverage   78.50%   79.05%   +0.55%     
==========================================
  Files         867      875       +8     
  Lines       28904    29472     +568     
  Branches      778      778              
==========================================
+ Hits        22690    23298     +608     
+ Misses       6214     6174      -40

Flag	Coverage Δ
unittests	`79.05% <89.12%> (+0.55%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
lib/kernels/include/kernels/array_shape.h	`100.00% <ø> (ø)`
.../op-attrs/include/op-attrs/dim_ordered/enumerate.h	`85.71% <100.00%> (ø)`
...include/op-attrs/dim_ordered/ff_ordered_from_map.h	`100.00% <100.00%> (ø)`
...b/op-attrs/include/op-attrs/dim_ordered/get_idxs.h	`100.00% <100.00%> (ø)`
lib/op-attrs/include/op-attrs/dim_ordered/slice.h	`100.00% <100.00%> (ø)`
lib/op-attrs/src/op-attrs/ff_dim_t.cc	`100.00% <100.00%> (ø)`
...-attrs/ops/attention/multihead_attention_inputs.cc	`51.21% <100.00%> (ø)`
...s/attention/multihead_attention_parallel_inputs.cc	`50.64% <100.00%> (+2.00%)`	⬆️
lib/op-attrs/src/op-attrs/ops/batch_matmul.cc	`64.04% <100.00%> (ø)`
lib/op-attrs/src/op-attrs/ops/batch_norm.cc	`83.80% <100.00%> (+0.34%)`	⬆️
... and 54 more

... and 40 files with indirect coverage changes

Marsella8

great PR! left some small comments and fyis, nothing too important : )

Reviewed 15 of 32 files at r1, 70 of 70 files at r2, all commit messages.
Reviewable status: all files reviewed, 8 unresolved discussions (waiting on @lockshaw and @victorli2002)

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 7 at r2 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("slice(FFOrdered<T>, ff_dim_t, ff_dim_t)") {

Better to have a test case with 6 subcases, since this way you have to define d only once and you can then reuse it across subcases) (Or also 2 tests with 3 subcases each)

lib/op-attrs/src/op-attrs/ops/concat.cc line 50 at r2 (raw file):

    }
    std::map<ff_dim_t, size_t> returned_wrapped;
    for (const auto &[key, value] : returned.value()) {

see map_keys in utils/containers/map_keys.h: )

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 342 at r2 (raw file):

std::vector<ff_dim_t> inner_to_outer_idxs(FFOrdered<T> const &ff_ordered) {
  std::vector<ff_dim_t> idxs;
  for (size_t i = 0; i < ff_ordered.size(); i++) {

alternatively, you can also use range() in 'utils/containers/range.h' (containers in general is pretty awesome, there's a lot of useful functions), though obviously like this is perfectly fine.

lib/op-attrs/include/op-attrs/relative_ff_dim_t.h line 7 at r2 (raw file):

#include "rapidcheck.h"

namespace rc {

for rapidcheck support for dtgen types you can add the "rapidcheck" string in the features array (inside of the .struct.toml file). Not sure if it applies here, just an fyi : )

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 26 at r2 (raw file):

    nonnegative_int nn_int_2 = nonnegative_int{2};
    SUBCASE("LHS: nonnegative_int, RHS: nonnegative_int, equal") {
      CHECK((nn_int_1a == nn_int_1b) == true);

to add support for doctest fmting, take a look at utils/fmt/ (this way you can do direct comparisons inside the CHECK)

lib/op-attrs/src/op-attrs/ff_dim_t.cc line 9 at r2 (raw file):

               [](int value) { return FlexFlow::nonnegative_int(value); }));
}
} // namespace rc

see my rapidcheck comment

lib/pcg/src/pcg/computation_graph_builder.cc line 853 at r2 (raw file):

  stack_vector<ff_dim_t, MAX_TENSOR_DIM> axes_stack;
  std::transform(axes.begin(),

betterutils/containers/transform.h (if you haven't tried it, might misbehave with stack_vector)

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 123 at r2 (raw file):

      d,
      std::optional<relative_ff_dim_t>{start},
      std::optional<relative_ff_dim_t>{end});

template
FFOrdered slice(FFOrdered const& d,
std::optional<relative_ff_dim_t> const& start = std::nullopt,
std::optional<relative_ff_dim_t> const& end = std::nullopt) {
return relative_ff_dim_t_nonoverloaded_slice(d, start, end);
}

As an alternative to group together these 3 overloads? (ditto for the other 3 overloads). If you pass a value to a function that takes in an optional it will get implictly casted (and also handles the additional case of both being std::nullopt, not sure if you'd want that or not).

Code quote:

template <typename T>
FFOrdered<T> slice(FFOrdered<T> const &d,
                   std::nullopt_t const &start,
                   relative_ff_dim_t const &end) {
  return relative_ff_dim_t_nonoverloaded_slice(
      d,
      std::optional<relative_ff_dim_t>{start},
      std::optional<relative_ff_dim_t>{end});
}

template <typename T>
FFOrdered<T> slice(FFOrdered<T> const &d,
                   relative_ff_dim_t const &start,
                   std::nullopt_t const &end) {
  return relative_ff_dim_t_nonoverloaded_slice(
      d,
      std::optional<relative_ff_dim_t>{start},
      std::optional<relative_ff_dim_t>{end});
}

template <typename T>
FFOrdered<T> slice(FFOrdered<T> const &d,
                   relative_ff_dim_t const &start,
                   relative_ff_dim_t const &end) {
  return relative_ff_dim_t_nonoverloaded_slice(
      d,
      std::optional<relative_ff_dim_t>{start},
      std::optional<relative_ff_dim_t>{end});

lockshaw

Reviewed 15 of 32 files at r1, 70 of 70 files at r2, all commit messages.
Reviewable status: all files reviewed, 19 unresolved discussions (waiting on @Marsella8 and @victorli2002)

lib/op-attrs/include/op-attrs/relative_ff_dim_t.h line 7 at r2 (raw file):

Previously, Marsella8 wrote…

for rapidcheck support for dtgen types you can add the "rapidcheck" string in the features array (inside of the .struct.toml file). Not sure if it applies here, just an fyi : )

Since ff_dim_t and relative_ff_dim_t are bounded by the MAX_TENSOR_DIM, unfortunately I don't think we can use the dtgen support as convenient as it would be.

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 342 at r2 (raw file):

Previously, Marsella8 wrote…

alternatively, you can also use range() in 'utils/containers/range.h' (containers in general is pretty awesome, there's a lot of useful functions), though obviously like this is perfectly fine.

range would definitely be good, but honestly a bunch of these index manipulation functions are super out-of-date and should probably just be removed entirely. @victorli2002 can you do a quick pass and remove any that aren't used anymore?

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 123 at r2 (raw file):

Previously, Marsella8 wrote…

template
FFOrdered slice(FFOrdered const& d,
std::optional<relative_ff_dim_t> const& start = std::nullopt,
std::optional<relative_ff_dim_t> const& end = std::nullopt) {
return relative_ff_dim_t_nonoverloaded_slice(d, start, end);
}

As an alternative to group together these 3 overloads? (ditto for the other 3 overloads). If you pass a value to a function that takes in an optional it will get implictly casted (and also handles the additional case of both being std::nullopt, not sure if you'd want that or not).

IIRC we were having some issues with overload resolution functioning properly with the unified signature, though I unfortunately seem to have forgotten what the exact issue was. @victorli2002 if you want you could try changing to @Marsella8's proposed signature and see if anything breaks

lib/op-attrs/src/op-attrs/ff_dim_t.cc line 9 at r2 (raw file):

Previously, Marsella8 wrote…

see my rapidcheck comment

Link: https://reviewable.io/reviews/flexflow/FlexFlow/1533#-OBmK6xk5zBEDvogOzc_

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 26 at r2 (raw file):

Previously, Marsella8 wrote…

to add support for doctest fmting, take a look at utils/fmt/ (this way you can do direct comparisons inside the CHECK)

IIRC all ostream-able types should automatically be printable by doctest. What error are you running into @victorli2002?

@Marsella8 is correct that this code should be refactored to CHECK(nn_int_1a == nn_int_1b)

lib/local-execution/src/legion_tensor_shape.cc line 10 at r2 (raw file):

}

legion_dim_t legion_dim_from_ff_dim(ff_dim_t ff_dim, TensorShape const &shape) {

Might be better to just forward the call to legion_dim_from_ff_dim(ff_dim_t, int) in legion_dim.cc (doesn't really have to do with your changes, but while we're looking at this file)

lib/local-execution/src/ops/split.cc line 109 at r2 (raw file):

                    out_block_size[i],
                    output_grad.shape,
                    attrs.axis.value.get_value());

Might be better to instead change the signature of calc_block_size?

Code quote:

attrs.axis.value.get_value());

lib/op-attrs/include/op-attrs/relative_ff_dim_t.h line 9 at r2 (raw file):

namespace rc {
template <>
struct Arbitrary<FlexFlow::relative_ff_dim_t> {

Not a big deal, but absolute namespaces are a bit less ambiguous

Suggestion:

struct Arbitrary<::FlexFlow::relative_ff_dim_t> {

lib/op-attrs/include/op-attrs/relative_ff_dim_t.struct.toml line 10 at r2 (raw file):

  "json",
  "fmt",
]

Suggestion:

features = [
  "eq",
  "ord",
  "hash",
  "json",
  "fmt",
  "rapidcheck",
]

lib/utils/include/utils/nonnegative_int/nonnegative_int.h line 48 at r2 (raw file):

  int value_;
};
} // namespace FlexFlow

Add support for fmt (https://fmt.dev/11.0/)

Suggestion:

private:
  int value_;
};

int format_as(nonnegative_int);

} // namespace FlexFlow

lib/op-attrs/include/op-attrs/dim_ordered/enumerate.h line 21 at r2 (raw file):

std::map<ff_dim_t, T> enumerate(FFOrdered<T> const &ff_ordered) {
  std::map<ff_dim_t, T> result;
  for (int raw_ff_dim : count(ff_ordered.size())) {

We could probably even change count to return nonnegative_ints (doesn't have to be done now, but could be an option in the future)

Code quote:

count(ff_ordered.size())

lib/pcg/src/pcg/computation_graph_builder.cc line 795 at r2 (raw file):

    std::optional<std::string> const &maybe_name) {

  ConcatAttrs attrs = ConcatAttrs{ff_dim_t{nonnegative_int{axis}}};

We actually probably want axis to be treated as relative here (i.e., this function should take in axis, wrap it as a relative_ff_dim_t, and then do the conversion to ff_dim_t and store the ff_dim_t in attrs)

Code quote:

  ConcatAttrs attrs = ConcatAttrs{ff_dim_t{nonnegative_int{axis}}};

lib/pcg/src/pcg/computation_graph_builder.cc line 813 at r2 (raw file):

tensor_guid_t ComputationGraphBuilder::flat(
    tensor_guid_t const &input,
    int start_dim,

We actually probably want start_dim and end_dim to be treated as relative here

lib/pcg/src/pcg/computation_graph_builder.cc line 900 at r2 (raw file):

tensor_guid_t ComputationGraphBuilder::softmax(
    tensor_guid_t const &input,
    std::optional<int> maybe_dim,

maybe_dim here should be treated as relative (generally any dim in the ComputationGraphBuilder interface should be treated as relative)

lib/local-execution/src/ops/linear.cc line 99 at r2 (raw file):

  auto attrs = acc.get_argument<LinearAttrs>(ATTRS);

  int in_dim = input.shape.at(ff_dim_t{nonnegative_int{0}}) + 1;

@reyna-abhyankar Why is there a + 1 here?

lib/local-execution/src/ops/linear.cc line 143 at r2 (raw file):

  }

  int in_dim = input.shape.at(ff_dim_t{nonnegative_int{0}}) + 1;

@reyna-abhyankar Why is there a + 1 here?

Code quote:

int in_dim = input.shape.at(ff_dim_t{nonnegative_int{0}}) + 1;

lib/op-attrs/include/op-attrs/parallel_tensor_dim_idx_t.variant.toml line 17 at r2 (raw file):

[[values]]
type = "::FlexFlow::relative_ff_dim_t"

I'm not sure this should be relative (just style-wise/semantically). What breaks if you make it ff_dim_t?

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 157 at r2 (raw file):

template <typename T>
struct DimOrdered<ff_dim_t, T> {

FYI I'm not necessarily a huge fan of doing this via template specialization, but for this PR it's fine.

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 45 at r2 (raw file):

  }

  TEST_CASE("nonnegative_int != comparisons") {

Some of these operator tests might(?) be cleaner with rapidcheck, e.g., you could check that <= is the same as < or ==. What you have here is fine too though

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 201 at r2 (raw file):

  }

  TEST_CASE("nonnegative_int adl_serializer") {

Suggestion:

 TEST_CASE("adl_serializer<nonnegative_int>") {

lib/op-attrs/src/op-attrs/ops/combine.cc line 32 at r2 (raw file):

  ParallelTensorShape output = input;
  shard_dim_at_idx(output,

Might not be the worst idea to add a conversion function from ff_dim_t to relative_ff_dim_t?

lib/kernels/test/src/test_transpose_kernel.cc line 11 at r2 (raw file):

    std::vector<ff_dim_t> perm = {ff_dim_t{nonnegative_int{0}},
                                  ff_dim_t{nonnegative_int{0}}};

Shouldn't this be 1 based on the old code?

Suggestion:

                             ff_dim_t{nonnegative_int{1}}};

victorli2002

Reviewable status: all files reviewed, 19 unresolved discussions (waiting on @lockshaw and @Marsella8)

lib/kernels/test/src/test_transpose_kernel.cc line 11 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Shouldn't this be 1 based on the old code?

Done.

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 342 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

range would definitely be good, but honestly a bunch of these index manipulation functions are super out-of-date and should probably just be removed entirely. @victorli2002 can you do a quick pass and remove any that aren't used anymore?

Done.

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 123 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

IIRC we were having some issues with overload resolution functioning properly with the unified signature, though I unfortunately seem to have forgotten what the exact issue was. @victorli2002 if you want you could try changing to @Marsella8's proposed signature and see if anything breaks

Tried it, got the following error message (which iirc was the same message I got earlier).

Code snippet:

/home/vli42/ff/nn_int/lib/op-attrs/include/op-attrs/dim_ordered/slice.h:17:21: error: could not convert 'FlexFlow::transform<FlexFlow::nonoverloaded_slice<FlexFlow::ff_dim_t, int>(const FlexFlow::DimOrdered<FlexFlow::ff_dim_t, int>&, const std::optional<FlexFlow::ff_dim_t>&, const std::optional<FlexFlow::ff_dim_t>&)::<lambda(const std::optional<FlexFlow::ff_dim_t>&)>::<lambda(const FlexFlow::ff_dim_t&)>, FlexFlow::ff_dim_t>((* & idx), <lambda closure object>FlexFlow::nonoverloaded_slice<FlexFlow::ff_dim_t, int>(const FlexFlow::DimOrdered<FlexFlow::ff_dim_t, int>&, const std::optional<FlexFlow::ff_dim_t>&, const std::optional<FlexFlow::ff_dim_t>&)::<lambda(const std::optional<FlexFlow::ff_dim_t>&)>::<lambda(const FlexFlow::ff_dim_t&)>{})' from 'optional<FlexFlow::nonnegative_int>' to 'optional<int>'
   17 |     return transform(idx, [](Idx const &i) { return i.value; });
      |            ~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                     |
      |                     optional<FlexFlow::nonnegative_int>

lib/op-attrs/src/op-attrs/ops/combine.cc line 32 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Might not be the worst idea to add a conversion function from ff_dim_t to relative_ff_dim_t?

Done.

lib/op-attrs/src/op-attrs/ops/concat.cc line 50 at r2 (raw file):

Previously, Marsella8 wrote…

see map_keys in utils/containers/map_keys.h: )

That map_keys takes std::unordered_map, if I'm not mistaken. Is there a good way to use it with std::map?

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 7 at r2 (raw file):

Previously, Marsella8 wrote…

Better to have a test case with 6 subcases, since this way you have to define d only once and you can then reuse it across subcases) (Or also 2 tests with 3 subcases each)

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 795 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

We actually probably want axis to be treated as relative here (i.e., this function should take in axis, wrap it as a relative_ff_dim_t, and then do the conversion to ff_dim_t and store the ff_dim_t in attrs)

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 813 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

We actually probably want start_dim and end_dim to be treated as relative here

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 853 at r2 (raw file):

Previously, Marsella8 wrote…

betterutils/containers/transform.h (if you haven't tried it, might misbehave with stack_vector)

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 900 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

maybe_dim here should be treated as relative (generally any dim in the ComputationGraphBuilder interface should be treated as relative)

Done.

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 26 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

IIRC all ostream-able types should automatically be printable by doctest. What error are you running into @victorli2002?

@Marsella8 is correct that this code should be refactored to CHECK(nn_int_1a == nn_int_1b)

Done.

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 201 at r2 (raw file):

  }

  TEST_CASE("nonnegative_int adl_serializer") {

Done.

lib/op-attrs/include/op-attrs/relative_ff_dim_t.struct.toml line 10 at r2 (raw file):

  "json",
  "fmt",
]

The original doesn't have ff_dim.struct.toml; why do we need it here?

victorli2002

Reviewable status: 67 of 88 files reviewed, 19 unresolved discussions (waiting on @lockshaw and @Marsella8)

lib/op-attrs/include/op-attrs/parallel_tensor_dim_idx_t.variant.toml line 17 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

I'm not sure this should be relative (just style-wise/semantically). What breaks if you make it ff_dim_t?

Done.

lib/utils/include/utils/nonnegative_int/nonnegative_int.h line 48 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Add support for fmt (https://fmt.dev/11.0/)

Done.

lockshaw

Reviewed 21 of 21 files at r3, 2 of 2 files at r4, all commit messages.
Reviewable status: all files reviewed, 20 unresolved discussions (waiting on @Marsella8 and @victorli2002)

lib/op-attrs/include/op-attrs/relative_ff_dim_t.struct.toml line 10 at r2 (raw file):

Previously, victorli2002 (Victor Li) wrote…

The original doesn't have ff_dim.struct.toml; why do we need it here?

True, this shouldn't be here due to https://reviewable.io/reviews/flexflow/flexflow-train/1533#-OBmK6xk5zBEDvogOzc_

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 123 at r2 (raw file):

Previously, victorli2002 (Victor Li) wrote…

Tried it, got the following error message (which iirc was the same message I got earlier).

That type error seems unsurprising: line 17 declares that it will return a std::optional<int>, but instead of returning i.value.get_value() you return i.value, which would result in returning a std::optional<nonnegative_int>. There very well may be a type issue, but this doesn't seem like one of them.

I'd recommend using utils/archetypes/value_type.h to force these overloads to be typechecked, e.g., https://github.com/flexflow/FlexFlow/blob/89c143d33b9e02faf8d6818a9c47b0d5da68a610/lib/utils/src/utils/containers/get_all_assignments.cc#L6-L10

lib/op-attrs/src/op-attrs/ops/concat.cc line 50 at r2 (raw file):

Previously, victorli2002 (Victor Li) wrote…

That map_keys takes std::unordered_map, if I'm not mistaken. Is there a good way to use it with std::map?

Add an overload for std::map--we often just materialize the overloads lazily as needed rather than trying to get them all up-front

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 7 at r2 (raw file):

Previously, victorli2002 (Victor Li) wrote…

Done.

In general we prefer one test-case per function, but in cases like this where there's a huge degree of overlap combining them can be a good idea

lib/pcg/src/pcg/computation_graph_builder.cc line 853 at r2 (raw file):

Previously, victorli2002 (Victor Li) wrote…

Done.

I'm still seeing std::transform?

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 7 at r4 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("FF_DIM_T_TO_RELATIVE_FF_DIM_T") {

Remove all caps?

Code quote:

 TEST_CASE("FF_DIM_T_TO_RELATIVE_FF_DIM_T") {

lib/op-attrs/include/op-attrs/relative_ff_dim_t.h line 9 at r4 (raw file):

namespace FlexFlow {
ff_dim_t relative_ff_dim_t_to_ff_dim_t(relative_ff_dim_t ff_dim, int input_dim);

Could make this a nonnegative_int if you wanted

Code quote:

int input_dim)

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 7 at r4 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("slice(FFOrdered<T>, ..., ...") {

Test error cases (e.g., out-of-bounds behavior, start being > stop, etc.)

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 7 at r4 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("RELATIVE_FF_DIM_T_TO_FF_DIM_T") {

Remove all caps?

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 34 at r4 (raw file):

      relative_ff_dim_t relative_ff_dim = relative_ff_dim_t{-10};
      CHECK_THROWS(relative_ff_dim_t_to_ff_dim_t(relative_ff_dim, input_dim));
    }

Also check the other out of range (i.e., too high)

Code quote:

    SUBCASE("OUT OF RANGE") {
      relative_ff_dim_t relative_ff_dim = relative_ff_dim_t{-10};
      CHECK_THROWS(relative_ff_dim_t_to_ff_dim_t(relative_ff_dim, input_dim));
    }

lib/op-attrs/src/op-attrs/relative_ff_dim_t.cc line 5 at r4 (raw file):

namespace FlexFlow {
ff_dim_t relative_ff_dim_t_to_ff_dim_t(relative_ff_dim_t ff_dim,

What's the expected behavior if ff_dim.value > input_dim? I'm fine with the behavior in the current code below, just make sure it's tested

lib/op-attrs/src/op-attrs/ops/layer_norm.cc line 169 at r4 (raw file):

      transform(non_layer_norm_dim_idxs, [&](ff_dim_t const &dim_idx) {
        return shard_dim_at_idx(input_shape,
                                ff_dim_t_to_relative_ff_dim_t(dim_idx));

I'd prefer relative_ff_dim_t_from_ff_dim_t as the sides of the name match up nicer (i.e., the argument type is next to where the argument actually is) (and usually same for other conversion functions)

Code quote:

ff_dim_t_to_relative_ff_dim_t

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 335 at r4 (raw file):

template <typename T>
std::vector<ff_dim_t> inner_to_outer_idxs(FFOrdered<T> const &ff_ordered) {

Out of curiosity, where is this used?

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 337 at r4 (raw file):

std::vector<ff_dim_t> inner_to_outer_idxs(FFOrdered<T> const &ff_ordered) {
  std::vector<ff_dim_t> idxs;
  for (int i : range(0, ff_ordered.size())) {

Suggestion:

  for (int i : range(ff_ordered.size())) {

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 144 at r4 (raw file):

  std::unordered_set<parallel_tensor_dim_idx_t> indices;
  extend(indices, transform(range(num_shard_dims(shape.dims)), [](int idx) {
           return parallel_tensor_dim_idx_t(ff_dim_t{nonnegative_int{idx}});

Prefer brace initialization

Suggestion:

           return parallel_tensor_dim_idx_t{ff_dim_t{nonnegative_int{idx}}};

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 146 at r4 (raw file):

           return parallel_tensor_dim_idx_t(ff_dim_t{nonnegative_int{idx}});
         }));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::SUM));

Suggestion:

  indices.insert(parallel_tensor_dim_idx_t{ReplicaType::SUM});

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 147 at r4 (raw file):

         }));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::SUM));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::DISCARD_COPY));

Suggestion:

  indices.insert(parallel_tensor_dim_idx_t{ReplicaType::DISCARD_COPY});

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 250 at r4 (raw file):

  }

  TEST_CASE("nonnegative int fmt::to_string") {

Suggestion:

  TEST_CASE("fmt::to_string(nonnegative_int)") {

victorli2002

Reviewable status: 72 of 88 files reviewed, 20 unresolved discussions (waiting on @lockshaw and @Marsella8)

lib/op-attrs/include/op-attrs/relative_ff_dim_t.h line 9 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Could make this a nonnegative_int if you wanted

I think this change would fight better in the later PR where I put nonnegative_int in more places.

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 335 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Out of curiosity, where is this used?

it's in local-execution/src/ops/transpose.cc

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 144 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Prefer brace initialization

Done.

lib/op-attrs/src/op-attrs/relative_ff_dim_t.cc line 5 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

What's the expected behavior if ff_dim.value > input_dim? I'm fine with the behavior in the current code below, just make sure it's tested

Added a test; currently it just makes a ff_dim_t with the same value as ff_dim.value.

lib/op-attrs/src/op-attrs/ops/concat.cc line 50 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Add an overload for std::map--we often just materialize the overloads lazily as needed rather than trying to get them all up-front

Done.

lib/op-attrs/src/op-attrs/ops/layer_norm.cc line 169 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

I'd prefer relative_ff_dim_t_from_ff_dim_t as the sides of the name match up nicer (i.e., the argument type is next to where the argument actually is) (and usually same for other conversion functions)

Done.

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 7 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Remove all caps?

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 7 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Remove all caps?

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 34 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Also check the other out of range (i.e., too high)

Done.

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 7 at r4 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Test error cases (e.g., out-of-bounds behavior, start being > stop, etc.)

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 853 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

I'm still seeing std::transform?

Done.

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 337 at r4 (raw file):

std::vector<ff_dim_t> inner_to_outer_idxs(FFOrdered<T> const &ff_ordered) {
  std::vector<ff_dim_t> idxs;
  for (int i : range(0, ff_ordered.size())) {

Done.

lib/utils/test/src/utils/nonnegative_int/nonnegative_int.cc line 250 at r4 (raw file):

  }

  TEST_CASE("nonnegative int fmt::to_string") {

Done.

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 146 at r4 (raw file):

           return parallel_tensor_dim_idx_t(ff_dim_t{nonnegative_int{idx}});
         }));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::SUM));

Done.

lib/op-attrs/src/op-attrs/parallel_tensor_shape.cc line 147 at r4 (raw file):

         }));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::SUM));
  indices.insert(parallel_tensor_dim_idx_t(ReplicaType::DISCARD_COPY));

Done.

victorli2002

Reviewable status: 71 of 88 files reviewed, 20 unresolved discussions (waiting on @lockshaw and @Marsella8)

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 123 at r2 (raw file):

Previously, lockshaw (Colin Unger) wrote…

That type error seems unsurprising: line 17 declares that it will return a std::optional<int>, but instead of returning i.value.get_value() you return i.value, which would result in returning a std::optional<nonnegative_int>. There very well may be a type issue, but this doesn't seem like one of them.

I'd recommend using utils/archetypes/value_type.h to force these overloads to be typechecked, e.g., https://github.com/flexflow/FlexFlow/blob/89c143d33b9e02faf8d6818a9c47b0d5da68a610/lib/utils/src/utils/containers/get_all_assignments.cc#L6-L10

Done.

lockshaw

Reviewed 16 of 16 files at r5, 1 of 1 files at r7, all commit messages.
Dismissed @Marsella8 from 8 discussions.
Reviewable status: all files reviewed, 17 unresolved discussions (waiting on @victorli2002)

lib/op-attrs/include/op-attrs/dim_ordered/dim_ordered.h line 335 at r4 (raw file):

Previously, victorli2002 (Victor Li) wrote…

it's in local-execution/src/ops/transpose.cc

If that's the only place, I'd just inline this function there and then remove the definition here--I don't think this is an operation that necessarily makes a whole lot of sense to have on FFOrdered

lib/op-attrs/include/op-attrs/dim_ordered/slice.h line 13 at r7 (raw file):

namespace FlexFlow {

using T = value_type<0>;

Should be done in the slice.cc file, not in the .h file

Code quote:

using T = value_type<0>;

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 7 at r7 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("ff_dim_to_to_relative_ff_dim_t") {

Suggestion:

  TEST_CASE("relative_ff_dim_t_from_ff_dim_t") {

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 8 at r7 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("ff_dim_to_to_relative_ff_dim_t") {
    SUBCASE("Zero") {

Suggestion:

    SUBCASE("absolute index is zero") {

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 15 at r7 (raw file):

    }

    SUBCASE("Positive") {

Suggestion:

    SUBCASE("absolute index is positive") {

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 10 at r7 (raw file):

    int input_dim = 5;

    SUBCASE("Zero") {

Suggestion:

    SUBCASE("relative index is zero") {

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 13 at r7 (raw file):

      relative_ff_dim_t relative_ff_dim = relative_ff_dim_t{0};
      ff_dim_t ff_dim =
          ff_dim_t_from_relative_ff_dim_t_ff_dim_t(relative_ff_dim, input_dim);

Suggestion:

ff_dim_t_from_relative_ff_dim_t

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 17 at r7 (raw file):

    }

    SUBCASE("Positive") {

Suggestion:

   SUBCASE("relative index is positive") {

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 24 at r7 (raw file):

    }

    SUBCASE("Negative") {

Suggestion:

    SUBCASE("relative index is negative") {

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 31 at r7 (raw file):

    }

    SUBCASE("Negative out of range") {

Ideally two nested subcases under "relative index is negative"--one where it is in range and one where it is out of range

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 42 at r7 (raw file):

          ff_dim_t_from_relative_ff_dim_t_ff_dim_t(relative_ff_dim, input_dim);
      CHECK(ff_dim == ff_dim_t{nonnegative_int{10}});
    }

Ideally two nested subcases under "relative index is positive"--one where it is in range, and one where it is out of range

Code quote:

    SUBCASE("Positive out of range") {
      relative_ff_dim_t relative_ff_dim = relative_ff_dim_t{10};
      ff_dim_t ff_dim =
          ff_dim_t_from_relative_ff_dim_t_ff_dim_t(relative_ff_dim, input_dim);
      CHECK(ff_dim == ff_dim_t{nonnegative_int{10}});
    }

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 68 at r7 (raw file):

      CHECK(result == correct);
    }
    SUBCASE("start index > stop index") {

Is there a reason we don't throw in this case?

lib/op-attrs/src/op-attrs/ff_dim_t.cc line 13 at r7 (raw file):

  return gen::construct<::FlexFlow::ff_dim_t>(
      gen::map(gen::inRange<int>(0, MAX_TENSOR_DIM),
               [](int value) { return FlexFlow::nonnegative_int(value); }));

Suggestion:

               [](int value) { return FlexFlow::nonnegative_int{value}; }));

lib/pcg/src/pcg/computation_graph_builder.cc line 865 at r7 (raw file):

            std::back_inserter(axes_stack),
            [&input_shape](int axis) {
              return ff_dim_t_from_relative_ff_dim_t_ff_dim_t(

Normalization should be done before the bounds-checking on axes, not afterward

lib/pcg/src/pcg/computation_graph_builder.cc line 867 at r7 (raw file):

              return ff_dim_t_from_relative_ff_dim_t_ff_dim_t(
                  relative_ff_dim_t{axis}, num_dims(input_shape));
            });

Suggestion:

  stack_vector<ff_dim_t, MAX_TENSOR_DIM> axes_stack = 
    transform(axes, [&](int axis) {
      return ff_dim_t_from_relative_ff_dim_t_ff_dim_t(
        relative_ff_dim_t{axis}, num_dims(input_shape));
    });

lib/pcg/src/pcg/computation_graph_builder.cc line 926 at r7 (raw file):

  }

  SoftmaxAttrs attrs = SoftmaxAttrs{ff_dim_t_from_relative_ff_dim_t_ff_dim_t(

Normalization should be ideally done before the bounds checking on dim

Code quote:

ff_dim_t_from_relative_ff_dim_t_ff_dim_t(

victorli2002

Reviewable status: all files reviewed, 17 unresolved discussions (waiting on @lockshaw)

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 31 at r7 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Ideally two nested subcases under "relative index is negative"--one where it is in range and one where it is out of range

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 42 at r7 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Ideally two nested subcases under "relative index is positive"--one where it is in range, and one where it is out of range

Done.

lib/op-attrs/test/src/op-attrs/dim_ordered/slice.cc line 68 at r7 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Is there a reason we don't throw in this case?

That's just how the behavior works atm, can fix if needed. (It uses the subvec function)

lib/pcg/src/pcg/computation_graph_builder.cc line 865 at r7 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Normalization should be done before the bounds-checking on axes, not afterward

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 926 at r7 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Normalization should be ideally done before the bounds checking on dim

Done.

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 7 at r7 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("ff_dim_to_to_relative_ff_dim_t") {

Done.

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 8 at r7 (raw file):

TEST_SUITE(FF_TEST_SUITE) {
  TEST_CASE("ff_dim_to_to_relative_ff_dim_t") {
    SUBCASE("Zero") {

Done.

lib/op-attrs/test/src/op-attrs/ff_dim_t.cc line 15 at r7 (raw file):

    }

    SUBCASE("Positive") {

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 10 at r7 (raw file):

    int input_dim = 5;

    SUBCASE("Zero") {

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 13 at r7 (raw file):

      relative_ff_dim_t relative_ff_dim = relative_ff_dim_t{0};
      ff_dim_t ff_dim =
          ff_dim_t_from_relative_ff_dim_t_ff_dim_t(relative_ff_dim, input_dim);

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 17 at r7 (raw file):

    }

    SUBCASE("Positive") {

Done.

lib/op-attrs/test/src/op-attrs/relative_ff_dim_t.cc line 24 at r7 (raw file):

    }

    SUBCASE("Negative") {

Done.

lib/op-attrs/src/op-attrs/ff_dim_t.cc line 13 at r7 (raw file):

  return gen::construct<::FlexFlow::ff_dim_t>(
      gen::map(gen::inRange<int>(0, MAX_TENSOR_DIM),
               [](int value) { return FlexFlow::nonnegative_int(value); }));

Done.

lockshaw

Reviewed 28 of 28 files at r8, all commit messages.
Reviewable status: all files reviewed, 13 unresolved discussions (waiting on @victorli2002)

lib/utils/src/utils/stack_vector/stack_vector_of.cc line 1 at r8 (raw file):

#include "utils/stack_vector/stack_vector_of.h"

Add explicit instantiation of stack_vector_of for stack_vector<value_type, ...>

lib/local-execution/src/ops/transpose.cc line 43 at r8 (raw file):

  auto const &attrs = acc.get_argument<TransposeAttrs>(ATTRS);
  std::vector<ff_dim_t> perm;
  size_t const &size = attrs.perm.size();

Don't use variables with reference type unless you're certain they're useful (which in this case it's not, the pointer is probably no smaller than the size_t anyway)

Suggestion:

size_t size = attrs.perm.size();

lib/local-execution/src/ops/transpose.cc line 46 at r8 (raw file):

  for (size_t const i : range(size)) {
    perm.push_back(ff_dim_t{nonnegative_int{size - i - 1}});
  }

Avoid unnecessary const

Suggestion:

  for (size_t i : range(size)) {
    perm.push_back(ff_dim_t{nonnegative_int{size - i - 1}});
  }

lib/local-execution/src/ops/transpose.cc line 46 at r8 (raw file):

  for (size_t const i : range(size)) {
    perm.push_back(ff_dim_t{nonnegative_int{size - i - 1}});
  }

Prefer performing multi-step initialization in its own block unless it's essentially the only thing in the current block

Suggestion:

  std::vector<ff_dim_t> perm = [&] {
    std::vector<ff_dim_t> result;
    size_t size = attrs.perm.size();
    for (size_t const i : range(size)) {
      result.push_back(ff_dim_t{nonnegative_int{size - i - 1}});
    }
    return result;
  }();

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

#include "utils/stack_vector/stack_vector.h"

Add explicit instantiation of stack_vector<value_type, ...>

lib/op-attrs/src/op-attrs/dim_ordered/slice.cc line 11 at r8 (raw file):

using T = value_type<0>;

} // namespace FlexFlow

You actually have to instantiate the template functions (the using T is just declaring a type alias)

Suggestion:

namespace FlexFlow {

using T = value_type<0>;

template
  FFOrdered<T> ff_dim_t_nonoverloaded_slice(FFOrdered<T> const &d,
                                            std::optional<ff_dim_t> const &start,
                                            std::optional<ff_dim_t> const &end);

template
  FFOrdered<T> relative_ff_dim_t_nonoverloaded_slice(
    FFOrdered<T> const &d,
    std::optional<relative_ff_dim_t> const &start,
    std::optional<relative_ff_dim_t> const &end);

template
  FFOrdered<T> slice(FFOrdered<T> const &d,
                     std::optional<ff_dim_t> const &start,
                     std::optional<ff_dim_t> const &end);

template
  FFOrdered<T> slice(FFOrdered<T> const &d,
                     std::optional<relative_ff_dim_t> const &start,
                     std::optional<relative_ff_dim_t> const &end);

} // namespace FlexFlow

lib/pcg/src/pcg/computation_graph_builder.cc line 860 at r8 (raw file):

  if (any_of(axes_stack, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

Suggestion:

  auto resolve_dim_idx = [&](int dim_idx) {
    return ff_dim_t_from_relative_ff_dim_t(relative_ff_dim_t{axis},
                                           num_dims(input_shape));
  };
  
  stack_vector<ff_dim_t, MAX_TENSOR_DIM> axes_stack =
      stack_vector_of<MAX_TENSOR_DIM>(transform(axes, resolve_dim_idx));

  if (any_of(axes_stack, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

lib/pcg/src/pcg/computation_graph_builder.cc line 860 at r8 (raw file):

  if (any_of(axes_stack, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

Suggestion:

tensor_guid_t ComputationGraphBuilder::layer_norm(
    tensor_guid_t const &input,
    std::vector<int> const &relative_axes,
    bool elementwise_affine,
    float eps,
    std::optional<std::string> const &maybe_name) {

  TensorShape input_shape = this->get_shape(input);

  stack_vector<ff_dim_t, MAX_TENSOR_DIM> axes =
      stack_vector_of<MAX_TENSOR_DIM>(transform(relative_axes, [&](int axis) {
        return ff_dim_t_from_relative_ff_dim_t(relative_ff_dim_t{axis},
                                               num_dims(input_shape));
      }));

  if (any_of(axes, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

lib/utils/include/utils/stack_vector/stack_vector_of.h line 8 at r8 (raw file):

namespace FlexFlow {

template <size_t max_size, typename C, typename E = typename C::value_type>

We generally have template params in ALL_CAPS

Suggestion:

template <size_t MAX_SIZE, typename C, typename E = typename C::value_type>

lib/utils/test/src/utils/stack_vector/stack_vector_of.cc line 15 at r8 (raw file):

    CHECK(result[0] == 1);
    CHECK(result[1] == 2);
    CHECK(result[2] == 3);

Prefer .at for bounds-checking

Suggestion:

    CHECK(result.at(0) == 1);
    CHECK(result.at(1) == 2);
    CHECK(result.at(2) == 3);

lib/utils/test/src/utils/stack_vector/stack_vector_of.cc line 15 at r8 (raw file):

    CHECK(result[0] == 1);
    CHECK(result[1] == 2);
    CHECK(result[2] == 3);

Suggestion:

    stack_vector<int, MAXSIZE> result = stack_vector_of<MAXSIZE>(input);
    stack_vector<int, MAXSIZE> correct = {1, 2, 3};

    CHECK(result == correct);

victorli2002

Reviewable status: all files reviewed, 13 unresolved discussions (waiting on @lockshaw)

lib/local-execution/src/ops/transpose.cc line 43 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Don't use variables with reference type unless you're certain they're useful (which in this case it's not, the pointer is probably no smaller than the size_t anyway)

Done.

lib/local-execution/src/ops/transpose.cc line 46 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Avoid unnecessary const

Done.

lib/local-execution/src/ops/transpose.cc line 46 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Prefer performing multi-step initialization in its own block unless it's essentially the only thing in the current block

Done.

lib/op-attrs/src/op-attrs/dim_ordered/slice.cc line 11 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

You actually have to instantiate the template functions (the using T is just declaring a type alias)

Getting these errors

lib/utils/include/utils/stack_vector/stack_vector_of.h line 8 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

We generally have template params in ALL_CAPS

Done.

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Add explicit instantiation of stack_vector<value_type, ...>

What do you mean explicit instantiation?

lib/utils/test/src/utils/stack_vector/stack_vector_of.cc line 15 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Prefer .at for bounds-checking

Done.

lib/utils/test/src/utils/stack_vector/stack_vector_of.cc line 15 at r8 (raw file):

    CHECK(result[0] == 1);
    CHECK(result[1] == 2);
    CHECK(result[2] == 3);

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 860 at r8 (raw file):

  if (any_of(axes_stack, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 860 at r8 (raw file):

  if (any_of(axes_stack, [&](ff_dim_t axis) {
        return axis.value >= num_dims(input_shape);
      })) {

Done.

lib/pcg/src/pcg/computation_graph_builder.cc line 867 at r7 (raw file):

              return ff_dim_t_from_relative_ff_dim_t_ff_dim_t(
                  relative_ff_dim_t{axis}, num_dims(input_shape));
            });

Done.

… uses int

lockshaw

Reviewed 5 of 5 files at r9, all commit messages.
Reviewable status: all files reviewed, 5 unresolved discussions (waiting on @victorli2002)

lib/op-attrs/src/op-attrs/dim_ordered/slice.cc line 11 at r8 (raw file):

Previously, victorli2002 (Victor Li) wrote…

Getting these errors

I think the issue here is the ifndefs in this file--you should only have header guards in headers

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, victorli2002 (Victor Li) wrote…

What do you mean explicit instantiation?

using T = value_type<0>;

template struct stack_vector<T, 5>;

victorli2002

Reviewable status: all files reviewed, 5 unresolved discussions (waiting on @lockshaw)

lib/op-attrs/src/op-attrs/dim_ordered/slice.cc line 11 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

I think the issue here is the ifndefs in this file--you should only have header guards in headers

oops lol

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

using T = value_type<0>;

template struct stack_vector<T, 5>;

value_type looks like it's missing some operators that are used in stack_vector

lockshaw

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @victorli2002)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, victorli2002 (Victor Li) wrote…

value_type looks like it's missing some operators that are used in stack_vector

In that case try using ordered_value_type instead

victorli2002

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @lockshaw)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

In that case try using ordered_value_type instead

I can't find that type. Is it in the main branch?

lockshaw

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @victorli2002)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, victorli2002 (Victor Li) wrote…

I can't find that type. Is it in the main branch?

Good point, go ahead and copy it into this branch: implementation here

victorli2002

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @lockshaw)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Good point, go ahead and copy it into this branch: implementation here

Is there a way to give it support for std::optional?

lockshaw

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @victorli2002)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, victorli2002 (Victor Li) wrote…

Is there a way to give it support for std::optional?

That's actually a correctly-detected error: ordered_value_type is not default-constructible, so element_type is std::optional<ordered_value_type>, but then the type of the .data() method is incorrect. For now just remove that method

victorli2002

Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @lockshaw)

lib/utils/src/utils/stack_vector/stack_vector.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

That's actually a correctly-detected error: ordered_value_type is not default-constructible, so element_type is std::optional<ordered_value_type>, but then the type of the .data() method is incorrect. For now just remove that method

Done.

lib/utils/src/utils/stack_vector/stack_vector_of.cc line 1 at r8 (raw file):

Previously, lockshaw (Colin Unger) wrote…

Add explicit instantiation of stack_vector_of for stack_vector<value_type, ...>

Done.

victorli2002

Reviewable status: 99 of 106 files reviewed, 6 unresolved discussions (waiting on @lockshaw and @Marsella8)

lib/utils/include/utils/stack_vector/stack_vector.h line 295 at r10 (raw file):

  }

  std::array<element_type, MAXSIZE> stack_contents() const {

Changed this due to the type errors with ordered_value_type. stack_string.h still called the data function, so I created this to give it access to contents

lib/utils/include/utils/stack_string.h line 62 at r10 (raw file):

  friend fmt::basic_string_view<Char>
      format_as(stack_basic_string<Char, MAXSIZE> const &s) {
    return {s.contents.stack_contents().data(), s.length()};

changed to match stack_vector change

lockshaw

Reviewed 5 of 7 files at r10, 5 of 5 files at r11, all commit messages.
Reviewable status: all files reviewed, 4 unresolved discussions (waiting on @Marsella8 and @victorli2002)

Addressed

victorli2002 closed this Oct 31, 2024

victorli2002 reopened this Oct 31, 2024

victorli2002 requested a review from lockshaw October 31, 2024 04:54

victorli2002 marked this pull request as draft October 31, 2024 04:55

victorli2002 force-pushed the nn_int branch from f2b6d2a to f9fe89b Compare October 31, 2024 20:00

goliaro added the repo-refactor label Nov 5, 2024

victorli2002 force-pushed the nn_int branch from f9fe89b to 0a784d4 Compare November 8, 2024 01:47

Changed ff_dim to ff_dim_t, added in nonnegative_int type

670fb62

victorli2002 force-pushed the nn_int branch from 0a784d4 to 670fb62 Compare November 8, 2024 22:41

lockshaw mentioned this pull request Nov 9, 2024

Add DLRM Model Computational Graph #1532

Open

victorli2002 marked this pull request as ready for review November 15, 2024 22:56

victorli2002 requested a review from Marsella8 November 15, 2024 23:21

Marsella8 previously requested changes Nov 16, 2024

View reviewed changes

lockshaw requested changes Nov 19, 2024

View reviewed changes

victorli2002 commented Nov 27, 2024

View reviewed changes

victorli2002 force-pushed the nn_int branch from 35bd5d9 to 3cae680 Compare November 27, 2024 19:33

victorli2002 commented Nov 28, 2024

View reviewed changes

victorli2002 force-pushed the nn_int branch from 3cae680 to e56aeb7 Compare November 28, 2024 05:03

lockshaw removed the repo-refactor label Dec 16, 2024

lockshaw changed the base branch from repo-refactor to master December 16, 2024 08:36

lockshaw mentioned this pull request Dec 19, 2024

Local execution e2e training #1472

Open

lockshaw requested changes Dec 20, 2024

View reviewed changes

victorli2002 force-pushed the nn_int branch from 332a119 to f08041e Compare December 23, 2024 23:49

victorli2002 commented Dec 24, 2024

View reviewed changes

victorli2002 force-pushed the nn_int branch from f08041e to b617632 Compare December 25, 2024 04:44

victorli2002 commented Dec 25, 2024

View reviewed changes

victorli2002 force-pushed the nn_int branch from b617632 to 4dcd72e Compare December 27, 2024 01:43

lockshaw requested changes Jan 10, 2025

View reviewed changes

victorli2002 commented Jan 10, 2025

View reviewed changes

victorli2002 force-pushed the nn_int branch from 4dcd72e to 5e08c11 Compare January 11, 2025 09:33

lockshaw requested changes Jan 15, 2025

View reviewed changes

victorli2002 commented Jan 15, 2025

View reviewed changes

Changed ff_dim_t to use nonnegative_int, added relative_ff_dim_t that…

7881a82

… uses int

victorli2002 force-pushed the nn_int branch from 5e08c11 to 7881a82 Compare January 15, 2025 05:12

lockshaw requested changes Jan 15, 2025

View reviewed changes

victorli2002 commented Jan 15, 2025

View reviewed changes

victorli2002 force-pushed the nn_int branch from 76cc39b to 7881a82 Compare January 15, 2025 05:52

lockshaw requested changes Jan 15, 2025

View reviewed changes

victorli2002 commented Jan 15, 2025

View reviewed changes

lockshaw requested changes Jan 15, 2025

View reviewed changes

Merge branch 'master' into nn_int

0c93abe

victorli2002 commented Jan 15, 2025

View reviewed changes

lockshaw requested changes Jan 15, 2025

View reviewed changes

victorli2002 commented Jan 16, 2025

View reviewed changes

Adding value_type and ordered_value_type to some files

727fd85

victorli2002 commented Jan 16, 2025

View reviewed changes

Temporarily remove stack_contents

ad05f5c

lockshaw changed the title ~~Changed ff_dim to ff_dim_t, added in nonnegative_int type~~ Add nonnegative_int type Jan 16, 2025

lockshaw approved these changes Jan 16, 2025

View reviewed changes

lockshaw enabled auto-merge (squash) January 16, 2025 23:30

Format

22eda14

lockshaw merged commit 8f64a0f into flexflow:master Jan 17, 2025
7 of 8 checks passed

victorli2002 deleted the nn_int branch January 22, 2025 00:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add nonnegative_int type #1533

Add nonnegative_int type #1533

victorli2002 commented Oct 31, 2024 •

edited

Loading

codecov bot commented Oct 31, 2024 •

edited

Loading

Marsella8 left a comment

lockshaw left a comment

victorli2002 left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

lockshaw left a comment

victorli2002 left a comment

victorli2002 left a comment

lockshaw left a comment

Add nonnegative_int type #1533

Add nonnegative_int type #1533

Conversation

victorli2002 commented Oct 31, 2024 • edited Loading

codecov bot commented Oct 31, 2024 • edited Loading

Codecov Report

Marsella8 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

victorli2002 left a comment

Choose a reason for hiding this comment

lockshaw left a comment

Choose a reason for hiding this comment

victorli2002 commented Oct 31, 2024 •

edited

Loading

codecov bot commented Oct 31, 2024 •

edited

Loading