[Accuracy diff No.70] Fix accuracy diff for topk API #217

ooooo-create · 2025-05-30T16:33:41Z

topk 返回的索引，相同元素的情况不保证是稳定的，更加关注前面的值

paddle-bot · 2025-05-30T16:33:45Z

Thanks for your contribution!

tester/accuracy.py

Cutelemon6

有几个细节需要注意一下

tester/api_config/config_analyzer.py

Cutelemon6 · 2025-06-03T07:23:36Z

tester/api_config/config_analyzer.py

+                    elif self.dtype in {"int32", "int64"}:
+                        self.numpy_tensor = numpy.random.choice(numpy.arange(-x_numel, x_numel), size=self.shape, replace=False).astype(self.dtype)
+                    else:
+                        raise ValueError(f"Unsupported dtype {self.dtype} for paddle.topk")


f"Unsupported dtype {self.dtype} for paddle.topk / paddle.Tensor.topk"

Cutelemon6 · 2025-06-03T07:37:15Z

tester/api_config/config_analyzer.py

+                    if self.dtype in {"bfloat16", "float32", "float64"}:
+                        dtype = "float32" if self.dtype == "bfloat16" else self.dtype
+                        self.numpy_tensor = numpy.linspace(-x_numel, x_numel, x_numel, dtype=dtype).reshape(self.shape)
+                        if numpy.unique(self.numpy_tensor).size < x_numel:
+                            self.numpy_tensor = generate_unique_array(x_numel, dtype).reshape(self.shape)
+                    elif self.dtype == "float16":
+                        self.numpy_tensor = generate_unique_array(x_numel, self.dtype).reshape(self.shape)


非 float 16 经过 numpy.linspace 也会产生相同的元素吗（舍入错误），考虑将范围放在 dtype 的最大范围内再生成呢

numpy.linspace(numpy.finfo(dtype).min, numpy.finfo(dtype).max, x_numel, dtype=dtype).reshape(self.shape)

float16 也试试。看了下有一些 float16 配置 numel 很大，但是 float16 表示范围很有限。

upd：float16 最多表示 63488 个有限值，超出这个元素数量的 tensor 具有随机输出。

~~同学可以了解一下 float16 的表示范围，元素数量1亿以内的的 float16 tensor 理论上都能通过，没有随机性。~~

print(f"float16 max: {numpy.finfo(numpy.float16).max}, float16 min: {numpy.finfo(numpy.float16).min}, float16 eps: {numpy.finfo(numpy.float16).eps}") print(f"max tensor numel est: {(numpy.finfo(numpy.float16).max.astype(numpy.float64) - numpy.finfo(numpy.float16).min.astype(numpy.float64)) / numpy.finfo(numpy.float16).eps}")

output

float16 max: 65504.0, float16 min: -65504.0, float16 eps: 0.0009765625 max tensor numel est: 134152192.0

tester/api_config/config_analyzer.py

…fix_topk

Cutelemon6 · 2025-06-05T03:51:43Z

tester/api_config/2_paddle_only_random/random_calculation.txt

 paddle.put_along_axis(Tensor([7, 8000],"float32"), Tensor([7, 799],"int64"), Tensor([7, 799],"float32"), 1, )
 paddle.put_along_axis(Tensor([8, 8000],"float32"), Tensor([8, 799],"int64"), Tensor([8, 799],"float32"), 1, )
 paddle.put_along_axis(Tensor([9, 8000],"float32"), Tensor([9, 799],"int64"), Tensor([9, 799],"float32"), 1, )
+paddle.topk(Tensor([128, 1000],"float16"), k=5, )


这个 case 在 config_analyzer.py 中添加如下初始化，经过多次运行均可 pass

elif api_config.api_name == "paddle.topk": if self.check_arg(api_config, 0, "x"): self.numpy_tensor = numpy.linspace(numpy.finfo(self.dtype).min, numpy.finfo(self.dtype).max, num=self.numel()).astype(self.dtype).reshape(self.shape)

import numpy dtype = numpy.float16 numel = 128*1000 out = numpy.linspace(numpy.finfo(dtype).min, numpy.finfo(dtype).max, num=numel).astype(dtype) print(len(numpy.unique(out)))

打印出来实际上只有两个数呀，小数不是浮点数，浮点数是有限的（二进制位数是确定的），小数是无限的，浮点数有精度，两个小数间隔在精度范围之外不能用两个浮点数来表示，浮点数越大，相邻的间隔也会越大

确实嗷，是我没考虑全抱歉，float16 最多也只能表示六万多个不同的数了，对于这个 case 确实会表现随机性。

Cutelemon6 · 2025-06-05T04:04:28Z

tester/api_config/config_analyzer.py

+                    if self.dtype in {"bfloat16", "float32", "float64"}:
+                        dtype = "float32" if self.dtype == "bfloat16" else self.dtype
+                        self.numpy_tensor = numpy.linspace(-x_numel, x_numel, x_numel, dtype=dtype).reshape(self.shape)
+                        if numpy.unique(self.numpy_tensor).size < x_numel:
+                            self.numpy_tensor = generate_unique_array(x_numel, dtype).reshape(self.shape)
+                    elif self.dtype == "float16":
+                        self.numpy_tensor = generate_unique_array(x_numel, self.dtype).reshape(self.shape)


upd：float16 最多表示 63488 个有限值，超出这个元素数量的 tensor 具有随机输出。

~~同学可以了解一下 float16 的表示范围，元素数量1亿以内的的 float16 tensor 理论上都能通过，没有随机性。~~

print(f"float16 max: {numpy.finfo(numpy.float16).max}, float16 min: {numpy.finfo(numpy.float16).min}, float16 eps: {numpy.finfo(numpy.float16).eps}") print(f"max tensor numel est: {(numpy.finfo(numpy.float16).max.astype(numpy.float64) - numpy.finfo(numpy.float16).min.astype(numpy.float64)) / numpy.finfo(numpy.float16).eps}")

output

float16 max: 65504.0, float16 min: -65504.0, float16 eps: 0.0009765625 max tensor numel est: 134152192.0

Cutelemon6 · 2025-06-05T04:57:30Z

所有 topk 的用例都通过了吗

ooooo-create · 2025-06-05T05:04:27Z

所有 topk 的用例都通过了吗

是滴是滴，唯一没过的已经放在 random_calculation.txt 里面了

Cutelemon6

LGTM

wanghuancoder

LGTM

fix

3940ca9

Cutelemon6 reviewed May 31, 2025

View reviewed changes

tester/accuracy.py Outdated Show resolved Hide resolved

luotao1 mentioned this pull request May 31, 2025

【开源任务】Paddle CPU/GPU Kernel 精度问题推全 PaddlePaddle/Paddle#72667

Open

refine data_gen to avoid unstable behavier accuracy check

e5dd429

Cutelemon6 reviewed May 31, 2025

View reviewed changes

tester/api_config/config_analyzer.py Outdated Show resolved Hide resolved

tester/api_config/config_analyzer.py Outdated Show resolved Hide resolved

apply review

eadc457

Cutelemon6 requested changes Jun 3, 2025

View reviewed changes

ooooo-create closed this Jun 4, 2025

apply review

5ade436

ooooo-create reopened this Jun 4, 2025

Merge branch 'main' of https://github.com/PFCCLab/PaddleAPITest into …

ca21f30

…fix_topk

ooooo-create force-pushed the fix_topk branch from ca471f4 to ca21f30 Compare June 4, 2025 11:02

Cutelemon6 requested changes Jun 5, 2025

View reviewed changes

Cutelemon6 approved these changes Jun 5, 2025

View reviewed changes

wanghuancoder approved these changes Jun 5, 2025

View reviewed changes

wanghuancoder merged commit 12d200d into PFCCLab:main Jun 5, 2025

luotao1 added the HappyOpenSource Pro 进阶版快乐开源活动，更具挑战性的任务 label Jun 10, 2025

ooooo-create deleted the fix_topk branch September 29, 2025 09:47

[Accuracy diff No.70] Fix accuracy diff for topk API #217

[Accuracy diff No.70] Fix accuracy diff for topk API #217

Uh oh!

Conversation

ooooo-create commented May 30, 2025

Uh oh!

paddle-bot bot commented May 30, 2025

Uh oh!

Uh oh!

Cutelemon6 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Cutelemon6 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Cutelemon6 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Cutelemon6 Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Cutelemon6 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

ooooo-create Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Cutelemon6 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Cutelemon6 Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Cutelemon6 commented Jun 5, 2025

Uh oh!

ooooo-create commented Jun 5, 2025

Uh oh!

Cutelemon6 left a comment

Choose a reason for hiding this comment

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Cutelemon6 Jun 5, 2025 •

edited

Loading

Cutelemon6 Jun 5, 2025 •

edited

Loading