Reference implementation of ONNX Reduce sum square is mismatch with ONNX Spec when noop_with_empty_axes == 1 #6103

RunnerZhong · 2024-04-29T02:43:06Z

Bug Report

Is the issue related to model conversion?

No

Describe the bug

When noop_with_empty_axes == 1 & axes is empty, in ONNX spec, it will return input tensor directly.
But in reference in onnx, it is mismatch. it returned np.square of the input tensor

System information

Reproduction instructions

Expected behavior

return input tensor directly

Notes

amankshihab · 2024-05-01T08:54:45Z

https://github.com/onnx/onnx/blob/d6f87121ba256ac6cc4d1da0463c300c278339d2/docs/Changelog.md?plain=1#L22221-L22222

The expected behavior is mentioned here as well.
Can I work on this? @justinchuby

justinchuby · 2024-05-01T17:26:03Z

Absolutely. Contributions are welcome and appreciated

gramalingam · 2024-05-02T17:10:14Z

This is complicated. Agree that there is a mismatch, but is the bug in the specification or implementation?

My personal interpretation is that this is a bug in the specification, not implementation, for the following reason: the attributes serve to define the set of axes being reduced: specifically, it is a flag to allow the empty list to indicate that all axes must be reduced (or that no axes must be reduced). Now, even if zero axes are reduced, it makes sense to compute the square. ReduceSumSquare is not actually a reduction-op: it is a reduction-op Sum applied to the square of the input.

I think the bug was in reusing the ReduceSum documentation for all ops ... it is correct for basic Reduction ops, but not ReduceSumSquare.

Of course, we can test with other backends/implementations (like onnxruntime, or even pytorch/tensorflow etc. IF they have such an option).

### Description This PR aligns the op implementation for `ReduceSumSquare18` when `axes` are not specified and `noop_with_empty_axes != 0` with that of the reference. ### Motivation and Context Current implementation of `ReduceSumSquare18` when axes are empty and `noop_with_empty_axes != 1` squares the input data and then returns it, when according to the reference it should just return the input data with no changes. This addresses issue #6103 Signed-off-by: Aman K Shihab <amanshihab276@gmail.com>

@xadupre

Reverts #6121 #6103 cc @xadupre @gramalingam

### Description This PR aligns the op implementation for `ReduceSumSquare18` when `axes` are not specified and `noop_with_empty_axes != 0` with that of the reference. ### Motivation and Context Current implementation of `ReduceSumSquare18` when axes are empty and `noop_with_empty_axes != 1` squares the input data and then returns it, when according to the reference it should just return the input data with no changes. This addresses issue onnx#6103 Signed-off-by: Aman K Shihab <amanshihab276@gmail.com> Signed-off-by: isdanni <leedanni@gmail.com>

@xadupre

…#6125) Reverts onnx#6121 onnx#6103 cc @xadupre @gramalingam Signed-off-by: isdanni <leedanni@gmail.com>

justinchuby · 2024-05-08T14:59:51Z

Following the discussion, I think it's reasonable to correct the spec. Out of curiosity, was there any reason you would expect the behavior to be different than the current one @RunnerZhong ?

RunnerZhong · 2024-05-09T02:36:08Z

I agree with below idea. So maybe we need to modify the spec of ops like ReduceSumSquare(ReduceLogSum, ReduceLogSumExp).

This is complicated. Agree that there is a mismatch, but is the bug in the specification or implementation?

My personal interpretation is that this is a bug in the specification, not implementation, for the following reason: the attributes serve to define the set of axes being reduced: specifically, it is a flag to allow the empty list to indicate that all axes must be reduced (or that no axes must be reduced). Now, even if zero axes are reduced, it makes sense to compute the square. ReduceSumSquare is not actually a reduction-op: it is a reduction-op Sum applied to the square of the input.

I think the bug was in reusing the ReduceSum documentation for all ops ... it is correct for basic Reduction ops, but not ReduceSumSquare.

Of course, we can test with other backends/implementations (like onnxruntime, or even pytorch/tensorflow etc. IF they have such an option).

RunnerZhong added the bug label Apr 29, 2024

justinchuby added reference implementation contributions welcome labels Apr 29, 2024

amankshihab mentioned this issue May 2, 2024

Corrected the ReduceSumSqure op to match the reference #6121

Merged

justinchuby mentioned this issue May 3, 2024

Revert "Corrected the ReduceSumSqure op to match the reference" #6125

Merged

github-merge-queue bot pushed a commit that referenced this issue May 3, 2024

Revert "Corrected the ReduceSumSqure op to match the reference" (#6125)

9ee76ea

Reverts #6121 #6103 cc @xadupre @gramalingam

isdanni pushed a commit to isdanni/onnx that referenced this issue May 6, 2024

Revert "Corrected the ReduceSumSqure op to match the reference" (onnx…

e72963d

…#6125) Reverts onnx#6121 onnx#6103 cc @xadupre @gramalingam Signed-off-by: isdanni <leedanni@gmail.com>

justinchuby added documentation Issues related to ONNX documentation spec clarification Clarification of the ONNX spec needed and removed reference implementation labels May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reference implementation of ONNX Reduce sum square is mismatch with ONNX Spec when noop_with_empty_axes == 1 #6103

Reference implementation of ONNX Reduce sum square is mismatch with ONNX Spec when noop_with_empty_axes == 1 #6103

RunnerZhong commented Apr 29, 2024

amankshihab commented May 1, 2024 •

edited

justinchuby commented May 1, 2024

gramalingam commented May 2, 2024

justinchuby commented May 8, 2024

RunnerZhong commented May 9, 2024

Reference implementation of ONNX Reduce sum square is mismatch with ONNX Spec when noop_with_empty_axes == 1 #6103

Reference implementation of ONNX Reduce sum square is mismatch with ONNX Spec when noop_with_empty_axes == 1 #6103

Comments

RunnerZhong commented Apr 29, 2024

Bug Report

Is the issue related to model conversion?

Describe the bug

System information

Reproduction instructions

Expected behavior

Notes

amankshihab commented May 1, 2024 • edited

justinchuby commented May 1, 2024

gramalingam commented May 2, 2024

justinchuby commented May 8, 2024

RunnerZhong commented May 9, 2024

amankshihab commented May 1, 2024 •

edited