Code Contribution: [Hard] [Operator Development] cummaxmin_backward #397

StrongSpoon · 2025-01-03T03:09:33Z

Description 任务介绍

Develop backward function for operator cummax and cummin.
开发cummac和cummin算子的反向功能。

Requirements 任务要求

Interface 接口
cummaxmin_backward(Tensor grad, Tensor input, Tensor indices, int dim) -> Tensor
Function reference 功能参考
https://pytorch.org/docs/stable/generated/torch.cummax.html#torch-cummax
https://pytorch.org/docs/stable/generated/torch.cummin.html#torch-cummin
Implementation reference 实现参考
https://github.com/FlagOpen/FlagGems/blob/master/src/flag_gems/ops/cummin.py

The operator should support all optional arguments defined in the interface.
算子应支持接口中定义的所有参数选项。
Please provide both accuracy test and performance test code.
请同时提供实现正确性测试与性能测试代码。

DDL 提交时间

Please submit a Pull Request within 3 weeks after accepting the assignment.
请于接取任务后三周内提交PR。

The text was updated successfully, but these errors were encountered:

2niuhe · 2025-01-13T06:13:47Z

2niuhe认领

2niuhe · 2025-01-13T10:15:31Z

I noticed while reviewing the PyTorch code that the cummaxmin_backward function is device-agnostic:

- func: cummaxmin_backward(Tensor grad, Tensor input, Tensor indices, int dim) -> Tensor
  variants: function
  device_check: NoCheck
  device_guard: False

The implementation of cummaxmin_backward is as follows:

Tensor cummaxmin_backward(const Tensor& grad, const Tensor& input, const Tensor& indices, int64_t dim) {
  if (input.sym_numel() == 0) {
    return input;
  }
  auto result = at::zeros_symint(input.sym_sizes(), input.options());

  // for composite compliance, use out-of-place variant of
  // `scatter_add` if `indices` or `grad` is a Tensor Subclass.
  if (areAnyTensorSubclassLike({indices, grad})) {
    return result.scatter_add(dim, indices, grad);
  }
  return result.scatter_add_(dim, indices, grad);
}

The actual kernel being called is scatter_add_, and I noticed that the flag_gems library already implements the scatter operator. This raises the question: is there still a need to implement cummaxmin_backward, or could we simply wrap the scatter operation to provide an in-place version?

Looking forward to your thoughts!

StrongSpoon added this to Triton China Community Jan 3, 2025

StrongSpoon converted this from a draft issue Jan 3, 2025

Tango2018cc assigned 2niuhe Jan 13, 2025

Tango2018cc moved this from Todo to In Progress in Triton China Community Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code Contribution: [Hard] [Operator Development] cummaxmin_backward #397

Code Contribution: [Hard] [Operator Development] cummaxmin_backward #397

StrongSpoon commented Jan 3, 2025

2niuhe commented Jan 13, 2025

2niuhe commented Jan 13, 2025

Code Contribution: [Hard] [Operator Development] cummaxmin_backward #397

Code Contribution: [Hard] [Operator Development] cummaxmin_backward #397

Comments

StrongSpoon commented Jan 3, 2025

Description 任务介绍

Requirements 任务要求

DDL 提交时间

2niuhe commented Jan 13, 2025

2niuhe commented Jan 13, 2025