[SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base #25089

vinodkc · 2019-07-10T03:02:25Z

What changes were proposed in this pull request?

This PR adds some tests converted from 'count.sql' to test UDFs

Diff comparing to 'count.sql'

diff --git a/sql/core/src/test/resources/sql-tests/results/count.sql.out b/sql/core/src/test/resources/sql-tests/results/udf/udf-count.sql.out
index b8a86d4c44..9476937abd 100644
--- a/sql/core/src/test/resources/sql-tests/results/count.sql.out
+++ b/sql/core/src/test/resources/sql-tests/results/udf/udf-count.sql.out
@@ -14,42 +14,42 @@ struct<>
 
 -- !query 1
 SELECT
-  count(*), count(1), count(null), count(a), count(b), count(a + b), count((a, b))
+  udf(count(*)), udf(count(1)), udf(count(null)), udf(count(a)), udf(count(b)), udf(count(a + b)), udf(count((a, b)))
 FROM testData
 -- !query 1 schema
-struct<count(1):bigint,count(1):bigint,count(NULL):bigint,count(a):bigint,count(b):bigint,count((a + b)):bigint,count(named_struct(a, a, b, b)):bigint>
+struct<udf(count(1)):string,udf(count(1)):string,udf(count(null)):string,udf(count(a)):string,udf(count(b)):string,udf(count((a + b))):string,udf(count(named_struct(a, a, b, b))):string>
 -- !query 1 output
 7	7	0	5	5	4	7
 
 
 -- !query 2
 SELECT
-  count(DISTINCT 1),
-  count(DISTINCT null),
-  count(DISTINCT a),
-  count(DISTINCT b),
-  count(DISTINCT (a + b)),
-  count(DISTINCT (a, b))
+  udf(count(DISTINCT 1)),
+  udf(count(DISTINCT null)),
+  udf(count(DISTINCT a)),
+  udf(count(DISTINCT b)),
+  udf(count(DISTINCT (a + b))),
+  udf(count(DISTINCT (a, b)))
 FROM testData
 -- !query 2 schema
-struct<count(DISTINCT 1):bigint,count(DISTINCT NULL):bigint,count(DISTINCT a):bigint,count(DISTINCT b):bigint,count(DISTINCT (a + b)):bigint,count(DISTINCT named_struct(a, a, b, b)):bigint>
+struct<udf(count(distinct 1)):string,udf(count(distinct null)):string,udf(count(distinct a)):string,udf(count(distinct b)):string,udf(count(distinct (a + b))):string,udf(count(distinct named_struct(a, a, b, b))):string>
 -- !query 2 output
 1	0	2	2	2	6
 
 
 -- !query 3
-SELECT count(a, b), count(b, a), count(testData.*) FROM testData
+SELECT udf(count(a, b)), udf(count(b, a)), udf(count(testData.*)) FROM testData
 -- !query 3 schema
-struct<count(a, b):bigint,count(b, a):bigint,count(a, b):bigint>
+struct<udf(count(a, b)):string,udf(count(b, a)):string,udf(count(a, b)):string>
 -- !query 3 output
 4	4	4
 
 
 -- !query 4
 SELECT
-  count(DISTINCT a, b), count(DISTINCT b, a), count(DISTINCT *), count(DISTINCT testData.*)
+  udf(count(DISTINCT a, b)), udf(count(DISTINCT b, a)), udf(count(DISTINCT *)), udf(count(DISTINCT testData.*))
 FROM testData
 -- !query 4 schema
-struct<count(DISTINCT a, b):bigint,count(DISTINCT b, a):bigint,count(DISTINCT a, b):bigint,count(DISTINCT a, b):bigint>
+struct<udf(count(distinct a, b)):string,udf(count(distinct b, a)):string,udf(count(distinct a, b)):string,udf(count(distinct a, b)):string>
 -- !query 4 output
 3	3	3	3

How was this patch tested?

Tested as guided in SPARK-27921.

HyukjinKwon · 2019-07-10T03:11:01Z

Hey @vinodkc, thanks for taking a look for this one. Actually we should add output files too :-). Mind double checking the guide I wrote at SPARK-27921 one by one?

vinodkc · 2019-07-10T03:35:15Z

@HyukjinKwon , Thanks for the guidance, I've added output file now.

HyukjinKwon · 2019-07-10T04:28:16Z

sql/core/src/test/resources/sql-tests/inputs/udf/udf-count.sql

+
+-- count with single expression
+SELECT
+  udf(count(*)), udf(count(1)), udf(count(null)), udf(count(a)), udf(count(b)), udf(count(a + b)), udf(count((a, b)))


@vinodkc, can make some other conbinations like udf(count(*)), count(udf(a)), udf(count(udf(a)))?

HyukjinKwon · 2019-07-10T04:28:46Z

sql/core/src/test/resources/sql-tests/inputs/udf/udf-count.sql

+  udf(count(DISTINCT a)),
+  udf(count(DISTINCT b)),
+  udf(count(DISTINCT (a + b))),
+  udf(count(DISTINCT (a, b)))


Here too :-)

HyukjinKwon · 2019-07-10T04:29:06Z

Looks fine if there are no output diff comparing to the original file

SparkQA · 2019-07-10T05:13:22Z

Test build #107427 has finished for PR 25089 at commit c7b4c46.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-07-10T06:40:02Z

Test build #107429 has finished for PR 25089 at commit b173330.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-07-11T00:39:27Z

Merged to master.

sql/core/src/test/resources/sql-tests/inputs/udf/udf-count.sql

c7b4c46

added udf-count.sql.out

b173330

HyukjinKwon changed the title ~~[SPARK-28275][SQL] Convert and port 'count.sql' into UDF test base~~ [SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base Jul 10, 2019

HyukjinKwon reviewed Jul 10, 2019

View reviewed changes

dongjoon-hyun added PYSPARK SQL TESTS labels Jul 10, 2019

HyukjinKwon closed this in b598dfd Jul 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base #25089

[SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base #25089

Uh oh!

vinodkc commented Jul 10, 2019 •

edited by HyukjinKwon

Loading

Uh oh!

HyukjinKwon commented Jul 10, 2019 •

edited

Loading

Uh oh!

vinodkc commented Jul 10, 2019

Uh oh!

HyukjinKwon Jul 10, 2019 •

edited

Loading

Uh oh!

HyukjinKwon Jul 10, 2019

Uh oh!

HyukjinKwon commented Jul 10, 2019

Uh oh!

SparkQA commented Jul 10, 2019

Uh oh!

SparkQA commented Jul 10, 2019

Uh oh!

HyukjinKwon commented Jul 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base #25089

[SPARK-28275][SQL][PYTHON][TESTS] Convert and port 'count.sql' into UDF test base #25089

Uh oh!

Conversation

vinodkc commented Jul 10, 2019 • edited by HyukjinKwon Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

HyukjinKwon commented Jul 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vinodkc commented Jul 10, 2019

Uh oh!

HyukjinKwon Jul 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Jul 10, 2019

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented Jul 10, 2019

Uh oh!

SparkQA commented Jul 10, 2019

Uh oh!

SparkQA commented Jul 10, 2019

Uh oh!

HyukjinKwon commented Jul 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vinodkc commented Jul 10, 2019 •

edited by HyukjinKwon

Loading

HyukjinKwon commented Jul 10, 2019 •

edited

Loading

HyukjinKwon Jul 10, 2019 •

edited

Loading