Skip to content

Conversation

@zhengshengning
Copy link
Contributor

PR Category

Operator Mechanism

PR Types

New features

Description

将该pr:#75965 ([Precision Depth Alignment] implement torch compatible max_pool2d grad kernel),cherry-pick 到 Fleety_12

该pr的修改描述:

由于paddle现有实现具有明显的性能优势,所以添加了 FLAGS_torch_compatible_pool_grad,用于开启和关闭兼容 torch 模式。
实现了一套新的兼容 torch 模式的 kernel,主要区别如下:
对于 f32 精度以下的数据进行精度提升。
从遍历输出维度改成了遍历输入维度。
添加了额外的单测。

由于cherry-pick时存在冲突,所以单独提一个PR

@paddle-bot
Copy link

paddle-bot bot commented Oct 27, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@zhengshengning zhengshengning merged commit d844a80 into PaddlePaddle:fleety_12 Oct 28, 2025
108 of 112 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants