【UnitTestFix No.8】fix test_mean_op.py #75457

WanRui37 · 2025-09-22T14:57:58Z

PR Category

Execute Infrastructure

PR Types

Bug fixes

Description

补充完整了缺失的public_python_api
补充完整了确实的prim_type
删除不必要的mean_all
删除不必要的mean_wrapper和reduce的类
修改以complex64和complex128为type，因为输入存在nan，造成grad报错的问题
修复float64为type，因为输入存在nan，造成grad报错的问题

后续需要一定的优化，用继承的方法简化代码

…alNanInput, RealValuedNanInput, ZeroSize

paddle-bot · 2025-09-22T14:58:04Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

YqGe585 · 2025-09-23T09:36:44Z

test/legacy_test/test_mean_op.py

-        self.outputs = {'Out': out_np}
-
-
-class TestMeanOp_Int32ZeroSize(OpTest):


可以解释一下这几个case为什么要删除吗？

我印象里面是说int类型grad不支持，我后续继续完善

YqGe585 · 2025-09-23T09:40:34Z

test/legacy_test/test_mean_op.py


    def test_checkout_grad(self):
-        self.check_grad(['X'], 'Out', check_pir=True, check_prim_pir=True)
+        place = core.CUDAPlace(0)


尽量使用op_test中的get_device_place来获取place，这样单测可以在不会影响GPU的正确性的情况下支持更多硬件。

好的导师

YqGe585 · 2025-09-23T09:42:22Z

test/legacy_test/test_mean_op.py

-    or not core.is_float16_supported(get_device_place()),
-    "core is not compiled with CUDA",
-)
-class TestReduceMeanOp(OpTest):


不要删除ReduceMean相关单测

好的导师，后续我会继续完善修改好

WanRui37 · 2025-09-24T04:46:22Z

研发导师您好

删除TestMeanOp_Int32ZeroSize等 Int类型的ZeroSize原因：
- mean_all的kernel没有添加int类型的支持，所以会报如下的错误
```
NotFoundError: The kernel (mean) with key (GPU, Undefined(AnyLayout), int64) is not found and GPU kernel cannot fallback to CPU one
```
- 添加了之后还是对不上，会出现
```
x: array(-9223372036854775808, dtype=int64)
y: array(nan)
```
因为nan是float类型的特殊值，int类型只能取到最小值代替，不能这样比较，所以删除了
已经用 get_device_place 替换 place = core.CUDAPlace(0)
Reduce已添加，把check_prim修改了一下

@YqGe585

luotao1 · 2025-09-24T06:22:12Z

助教您好

是研发导师😄

WanRui37 · 2025-09-24T06:37:28Z

研发导师您好，我称呼搞错了，不好意思

YqGe585 · 2025-09-24T06:41:58Z

助教您好
删除TestMeanOp_Int32ZeroSize等 Int类型的ZeroSize原因：

mean_all的kernel没有添加int类型的支持，所以会报如下的错误
NotFoundError: The kernel (mean) with key (GPU, Undefined(AnyLayout), int64) is not found and GPU kernel cannot fallback to CPU one
添加了之后还是对不上，会出现
x: array(-9223372036854775808, dtype=int64)
y: array(nan)
因为nan是float类型的特殊值，int类型只能取到最小值代替，不能这样比较，所以删除了
已经用 get_device_place 替换 place = core.CUDAPlace(0)

Reduce已添加，把check_prim修改了一下
@YqGe585

明白。那可以删除int zero-size的case。kernel中的数据类型就不要做修改了。

WanRui37 · 2025-09-24T06:46:11Z

助教您好
删除TestMeanOp_Int32ZeroSize等 Int类型的ZeroSize原因：

mean_all的kernel没有添加int类型的支持，所以会报如下的错误
NotFoundError: The kernel (mean) with key (GPU, Undefined(AnyLayout), int64) is not found and GPU kernel cannot fallback to CPU one
添加了之后还是对不上，会出现
x: array(-9223372036854775808, dtype=int64)
y: array(nan)
因为nan是float类型的特殊值，int类型只能取到最小值代替，不能这样比较，所以删除了
已经用 get_device_place 替换 place = core.CUDAPlace(0)

Reduce已添加，把check_prim修改了一下
@YqGe585
明白。那可以删除int zero-size的case。kernel中的数据类型就不要做修改了。

好的导师，等CI过了之后，我就把数据类型的添加做一个删除

WanRui37 · 2025-09-24T14:12:13Z

研发导师您好，上述2个CI错误都与mean无关，我后续是否只要删除kernel中的int数据类型就可以了？

- CI / Linux-DCU / Test (pull_request)
```
test_no_grad (Failed)
========================================
There are failed tests, which have been executed re-run,but success rate is less than 50%:
Summary Failed Tests... 
========================================
The following tests FAILED: 
                1121 - test_cdist (Timeout)
                497 - test_standalone_cross_step_overlap (Timeout)
Error: Process completed with exit code 8.
```
- CI-Build / Slice / Slice test (pull_request)
```
slice测试失败, 存在性能下降case, 失败case性能变化: {'Setitem - forward - Scalar - Tuple of Integers - float16 - paddle': -0.3087517129091814}
Update successful
Traceback (most recent call last):
File "/paddle/PaddleTest/framework/slice_benchmark/run.py", line 224, in <module>
    test.ci_test()
File "/paddle/PaddleTest/framework/slice_benchmark/run.py", line 164, in ci_test
    raise Exception("slice测试失败")
Exception: slice测试失败
```

YqGe585 · 2025-09-25T03:28:56Z

是的，删除掉类型之后，重新commit触发一下CI吧，有可能是某些随机的原因导致CI失败，应该与你的修改无关。后续如果还失败，可以尝试comment：/re-run all-failed，来触发失败的CI流水线。如果仍然失败，那么需要看一下是哪里的修改导致的。

WanRui37 · 2025-09-25T04:13:04Z

是的，删除掉类型之后，重新commit触发一下CI吧，有可能是某些随机的原因导致CI失败，应该与你的修改无关。后续如果还失败，可以尝试comment：/re-run all-failed，来触发失败的CI流水线。如果仍然失败，那么需要看一下是哪里的修改导致的。

谢谢导师，我已重新commit

YqGe585

LGTM

WanRui37 and others added 6 commits September 20, 2025 17:36

fix TestMeanOp_Complex64ZeroSize

8f8f274

v2: Increased the number of input elements in TestMeanOp_ImagNanInput

6af69f8

v3: Fix gradient check for MeanOp with NaN inputs in ImagNanInput, Re…

96932fe

…alNanInput, RealValuedNanInput, ZeroSize

v4: Removed all related content

7719c42

v5: Remove all mean_all and fix RealValuedNanInput

66d9693

Merge branch 'PaddlePaddle:develop' into fix_001

008dc20

paddle-bot bot added the contributor External developers label Sep 22, 2025

luotao1 mentioned this pull request Sep 23, 2025

【启航计划】PaddlePaddle GPU单测修复 #75208

Open

luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Sep 23, 2025

luotao1 assigned luotao1 and YqGe585 Sep 23, 2025

YqGe585 suggested changes Sep 23, 2025

View reviewed changes

WanRui37 and others added 2 commits September 24, 2025 04:37

v6: Fixed reduce error and mean not supporting int type error

a45a4bf

Merge branch 'PaddlePaddle:develop' into fix_001

a8b4c53

v7: Remove redundant int types

8dec6c5

YqGe585 approved these changes Sep 25, 2025

View reviewed changes

luotao1 approved these changes Sep 26, 2025

View reviewed changes

luotao1 merged commit 7fb1efb into PaddlePaddle:develop Sep 26, 2025
70 of 74 checks passed

		self.outputs = {'Out': out_np}


		class TestMeanOp_Int32ZeroSize(OpTest):

【UnitTestFix No.8】fix test_mean_op.py #75457

【UnitTestFix No.8】fix test_mean_op.py #75457

Uh oh!

Conversation

WanRui37 commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Sep 22, 2025

Uh oh!

YqGe585 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

WanRui37 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

YqGe585 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

WanRui37 Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

YqGe585 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

WanRui37 Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WanRui37 commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

luotao1 commented Sep 24, 2025

Uh oh!

WanRui37 commented Sep 24, 2025

Uh oh!

YqGe585 commented Sep 24, 2025

Uh oh!

WanRui37 commented Sep 24, 2025

Uh oh!

WanRui37 commented Sep 24, 2025

Uh oh!

YqGe585 commented Sep 25, 2025

Uh oh!

WanRui37 commented Sep 25, 2025

Uh oh!

YqGe585 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WanRui37 commented Sep 22, 2025 •

edited

Loading

WanRui37 Sep 23, 2025 •

edited

Loading

WanRui37 Sep 23, 2025 •

edited

Loading

WanRui37 commented Sep 24, 2025 •

edited

Loading