-
Notifications
You must be signed in to change notification settings - Fork 660
【Hackathon 9th No.21、23】add unit tests for fused_hadamard_quant_fp8, moe_fused_hadamard_quant_fp8 #4094
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【Hackathon 9th No.21、23】add unit tests for fused_hadamard_quant_fp8, moe_fused_hadamard_quant_fp8 #4094
Conversation
|
Thanks for your contribution! |
| paddle_output_fp32.numpy(), | ||
| actual_output_fp32.numpy(), | ||
| atol=1e-1, | ||
| rtol=1e-1, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
只能在1e-1的精度上对齐吗?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我之前没有完全按照 kernel 计算逻辑,低精度计算顺序什么的可能影响比较大,我重新实现一下
…into ut_fused_hadamard_quant_fp8
…o-create/FastDeploy into ut_fused_hadamard_quant_fp8
|
融合算子中的hadamard只有一种实现,hadamard matrix的创建来自函数https://github.com/PaddlePaddle/FastDeploy/blob/develop/fastdeploy/model_executor/layers/utils.py#L197 |

add unit tests for fused_hadamard_quant_fp8, moe_fused_hadamard_quant_fp8