Skip to content

Conversation

tianyuzhou668
Copy link
Contributor

修复了softmax kernel中bf16的精度问题;
新增了index_elementwise_put_kernel的适配;
新增了显存不足时oom的warning;
支持了flash-attention同时传入is_casual和mask的情况,并针对mask中的最小值精度问题进行了修复;
支持了fft;

Copy link

paddle-bot bot commented Oct 16, 2025

Thanks for your contribution!

Copy link
Collaborator

@YqGe585 YqGe585 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@YqGe585 YqGe585 merged commit 6f01062 into PaddlePaddle:develop Oct 17, 2025
11 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants