Add bincount op #36317

smallv0221 · 2021-10-09T14:21:24Z

PR types

New features

PR changes

OPs

Describe

Add bincount op
中文文档pr：PaddlePaddle/docs#3959
英文文档：

paddle-bot-old · 2021-10-09T14:21:27Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhiqiu

LGTM for op_function_generator.cc

TCChenlong · 2021-10-12T01:47:44Z

python/paddle/tensor/linalg.py

+
+    Args:
+        x (Tensor): A Tensor with non-negative integer. Should be 1-D tensor.
+        weights (Tensor, optional): Weight for each value in the input tensor. Should have the same shape as input.


Default is None.

TCChenlong · 2021-10-12T01:47:54Z

python/paddle/tensor/linalg.py

+    Args:
+        x (Tensor): A Tensor with non-negative integer. Should be 1-D tensor.
+        weights (Tensor, optional): Weight for each value in the input tensor. Should have the same shape as input.
+        minlength (int): Minimum number of bins. Should be non-negative integer.


int -> int, optional
Default is 0.

guoshengCS · 2021-10-18T12:07:40Z

paddle/fluid/operators/bincount_op.h

+          static_cast<double>(0));
+      for (int64_t i = 0; i < input_numel; i++) {
+        output_data[input_data[i]] += static_cast<double>(weights_data[i]);
+      }


这个float和double的分支代码是否可以使用T合并，也和int类型的Weights更适配

这边是为了保证与竞品一致。weights可以是任意数据类型，只有当weights float32时，output的类型才是float32。其他三种情况下output都是float64。

guoshengCS · 2021-10-18T12:20:40Z

paddle/fluid/operators/bincount_op.cu

+
+      KernelBincount<T, InputT, double><<<GET_BLOCKS(input_numel),
+                                          PADDLE_CUDA_NUM_THREADS, 0, stream>>>(
+          input_data, input_numel, has_weights, weights_data, output_data);


同CPU，float和double的分支代码是否可以使用T合并

guoshengCS · 2021-10-18T12:27:42Z

python/paddle/tensor/linalg.py

+
+    Args:
+        x (Tensor): A Tensor with non-negative integer. Should be 1-D tensor.
+        weights (Tensor, optional): Weight for each value in the input tensor. Should have the same shape as input. Default is None.


weight的dtype是否也要说明

jeff41404 · 2021-10-19T13:34:28Z

python/paddle/tensor/__init__.py

@@ -44,6 +44,7 @@
 from .linalg import cholesky  # noqa: F401
 from .linalg import bmm  # noqa: F401
 from .linalg import histogram  # noqa: F401
+from .linalg import bincount  # noqa: F401


we shall also add bincount in tensor_method_func list below to get paddle.Tensor.bincount

Done, thanks!

jeff41404 · 2021-10-19T13:47:43Z

paddle/fluid/operators/bincount_op.cu

+  PADDLE_ENFORCE_GE(
+      input_min, static_cast<InputT>(0),
+      platform::errors::InvalidArgument(
+          "The elements in input tensor must be non-negative ints"));


the check of PADDLE_ENFORCE* should in InferShape rather than in Compute, so we can detect illegal input earlier

Done, thanks!

jeff41404 · 2021-10-19T13:48:27Z

paddle/fluid/operators/bincount_op.cu

+    PADDLE_ENFORCE_EQ(input_type_match, true,
+                      platform::errors::InvalidArgument(
+                          "Input(X) holds the wrong type, it holds %s, but "
+                          "desires to be %s or %s",
+                          paddle::framework::DataTypeToString(input_type),
+                          paddle::framework::DataTypeToString(
+                              framework::proto::VarType::INT32),
+                          paddle::framework::DataTypeToString(
+                              framework::proto::VarType::INT64)));


the check of PADDLE_ENFORCE* should in InferShape rather than in Compute, so we can detect illegal input earlier

Done, thanks!

jeff41404 · 2021-10-19T13:48:47Z

paddle/fluid/operators/bincount_op.cu

+    PADDLE_ENFORCE_EQ(
+        platform::is_gpu_place(context.GetPlace()), true,
+        platform::errors::InvalidArgument("It must use CUDAPlace."));


the check of PADDLE_ENFORCE* should in InferShape rather than in Compute, so we can detect illegal input earlier. but in this case, because register BincountCUDAKernel in cuda, no need this check.

Done, thanks!

jeff41404 · 2021-10-19T13:50:35Z

paddle/fluid/operators/bincount_op.h

+  PADDLE_ENFORCE_GE(
+      *std::min_element(input_data, input_data + input_numel),
+      static_cast<InputT>(0),
+      platform::errors::InvalidArgument(
+          "The elements in input tensor must be non-negative ints"));


the check of PADDLE_ENFORCE* should in InferShape rather than in Compute, so we can detect illegal input earlier

Done, thanks!

jeff41404 · 2021-10-19T13:50:59Z

paddle/fluid/operators/bincount_op.h

+    PADDLE_ENFORCE_EQ(input_type_match, true,
+                      platform::errors::InvalidArgument(
+                          "Input(X) holds the wrong type, it holds %s, but "
+                          "desires to be %s or %s",
+                          paddle::framework::DataTypeToString(input_type),
+                          paddle::framework::DataTypeToString(
+                              framework::proto::VarType::INT32),
+                          paddle::framework::DataTypeToString(
+                              framework::proto::VarType::INT64)));


the check of PADDLE_ENFORCE* should in InferShape rather than in Compute, so we can detect illegal input earlier

Done, thanks!

jeff41404 · 2021-10-20T02:51:07Z

python/paddle/tensor/linalg.py

+    if paddle.max(x) < 0:
+        raise ValueError("Elements in Input(x) should all be non-negative")


x (Tensor): A Tensor with non-negative integer. Should be 1-D tensor.
shall we must check:

paddle.min(x) >= 0:

x.ndim == 1;

x.numel() != 0

jeff41404

lgtm

XiaoguangHu01

LG API

TCChenlong · 2021-10-22T04:43:33Z

python/paddle/tensor/linalg.py

+    Args:
+        x (Tensor): A Tensor with non-negative integer. Should be 1-D tensor.
+        weights (Tensor, optional): Weight for each value in the input tensor. Should have the same shape as input. Default is None.
+        minlength (int, optional): Minimum number of bins. Should be non-negative integer. Default is 0.


少了name参数

XiaoguangHu01

LG API

zhiqiu

LGTM for op_function_generator.cc

* Add bincount op * upload cpu version * fix unitest * fix unittest * fix unittest * fix en doc * add more test * fix en doc * add more test case * fix test * fix input vailidation * fix input check * fix unittest * fix test * fix en doc cherry-pick

Add bincount op

c042720

smallv0221 added 6 commits October 10, 2021 13:49

upload cpu version

421c118

fix unitest

d26d729

fix unittest

78ac5e0

fix unittest

2b518ed

fix en doc

b4347e2

add more test

4feec4f

zhiqiu reviewed Oct 11, 2021

View reviewed changes

TCChenlong reviewed Oct 12, 2021

View reviewed changes

fix en doc

994829c

guoshengCS reviewed Oct 18, 2021

View reviewed changes

smallv0221 added 2 commits October 19, 2021 09:17

add more test case

0b9ecb4

fix test

b75b67d

jeff41404 reviewed Oct 19, 2021

View reviewed changes

fix input vailidation

4cdc2e8

jeff41404 reviewed Oct 20, 2021

View reviewed changes

fix input check

aa83dd0

jeff41404 previously approved these changes Oct 20, 2021

View reviewed changes

smallv0221 added 2 commits October 20, 2021 07:45

fix unittest

74b18ea

fix test

401062e

smallv0221 dismissed jeff41404’s stale review via 401062e October 20, 2021 09:33

jeff41404 previously approved these changes Oct 21, 2021

View reviewed changes

XiaoguangHu01 previously approved these changes Oct 21, 2021

View reviewed changes

TCChenlong reviewed Oct 22, 2021

View reviewed changes

fix en doc

647aa0d

smallv0221 dismissed stale reviews from XiaoguangHu01 and jeff41404 via 647aa0d October 22, 2021 04:48

TCChenlong previously approved these changes Oct 22, 2021

View reviewed changes

Merge branch 'develop' into yxp210927

9f4590d

smallv0221 dismissed TCChenlong’s stale review via 9f4590d October 22, 2021 07:54

XiaoguangHu01 approved these changes Oct 25, 2021

View reviewed changes

zhiqiu approved these changes Oct 25, 2021

View reviewed changes

TCChenlong approved these changes Oct 25, 2021

View reviewed changes

jeff41404 merged commit 39f1912 into PaddlePaddle:develop Oct 25, 2021

smallv0221 mentioned this pull request Oct 25, 2021

[cherry-pick-2.2]Add bincount op #36709

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bincount op #36317

Add bincount op #36317

smallv0221 commented Oct 9, 2021 •

edited

Loading

paddle-bot-old bot commented Oct 9, 2021

zhiqiu left a comment

TCChenlong Oct 12, 2021

smallv0221 Oct 12, 2021

TCChenlong Oct 12, 2021

smallv0221 Oct 12, 2021

guoshengCS Oct 18, 2021

smallv0221 Oct 19, 2021

guoshengCS Oct 18, 2021

smallv0221 Oct 19, 2021

guoshengCS Oct 18, 2021

jeff41404 Oct 19, 2021

smallv0221 Oct 19, 2021

jeff41404 Oct 19, 2021

smallv0221 Oct 19, 2021

jeff41404 Oct 19, 2021

smallv0221 Oct 19, 2021

jeff41404 Oct 19, 2021 •

edited

Loading

smallv0221 Oct 19, 2021

jeff41404 Oct 19, 2021

smallv0221 Oct 19, 2021

jeff41404 Oct 19, 2021

smallv0221 Oct 19, 2021

jeff41404 Oct 20, 2021 •

edited

Loading

jeff41404 left a comment

XiaoguangHu01 left a comment

TCChenlong Oct 22, 2021

XiaoguangHu01 left a comment

zhiqiu left a comment

		if paddle.max(x) < 0:
		raise ValueError("Elements in Input(x) should all be non-negative")

Add bincount op #36317

Add bincount op #36317

Conversation

smallv0221 commented Oct 9, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Oct 9, 2021

zhiqiu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeff41404 Oct 19, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeff41404 Oct 20, 2021 • edited Loading

Choose a reason for hiding this comment

jeff41404 left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment

smallv0221 commented Oct 9, 2021 •

edited

Loading

jeff41404 Oct 19, 2021 •

edited

Loading

jeff41404 Oct 20, 2021 •

edited

Loading