Skip to content

Conversation

@itholic
Copy link
Contributor

@itholic itholic commented Sep 14, 2022

What changes were proposed in this pull request?

This PR proposes to integrate Grouped Aggregate Pandas UDF tests into *.sql test cases.

This PR includes the fixes below:

  • Add UDAFTestCase into SQLQueryTestSuite.scala to test the UDAF related functions in sql.
  • Add udaf directory and create related sql test cases into this directory.
  • Generate golden files for new added sql test files.
  • Skip from ThriftServerQueryTestSuite.scala for now.
  • Fix minor typos.

Why are the changes needed?

To improve the test coverage, so prevent the potential bug in the future.

Does this PR introduce any user-facing change?

No, it's test-only.

How was this patch tested?

Added sql test files and corresponding golden files.

@github-actions github-actions bot added the SQL label Sep 14, 2022
@itholic
Copy link
Contributor Author

itholic commented Sep 19, 2022

cc @HyukjinKwon could you help me on review when you find some time ?

@HyukjinKwon
Copy link
Member

Merged to master.

LuciferYang pushed a commit to LuciferYang/spark that referenced this pull request Sep 20, 2022
…o *.sql test cases

### What changes were proposed in this pull request?

This PR proposes to integrate Grouped Aggregate Pandas UDF tests into *.sql test cases.

This PR includes the fixes below:
- Add `UDAFTestCase` into `SQLQueryTestSuite.scala` to test the UDAF related functions in sql.
- Add `udaf` directory and create related sql test cases into this directory.
- Generate golden files for new added sql test files.
- Skip from `ThriftServerQueryTestSuite.scala` for now.
- Fix minor typos.

### Why are the changes needed?

To improve the test coverage, so prevent the potential bug in the future.

### Does this PR introduce _any_ user-facing change?

No, it's test-only.

### How was this patch tested?

Added sql test files and corresponding golden files.

Closes apache#37873 from itholic/SPARK-40419.

Authored-by: itholic <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
HyukjinKwon pushed a commit that referenced this pull request Sep 20, 2022
… unavailable

### What changes were proposed in this pull request?

This PR follow-up for #37873.

The UDAF test should be skipped when pyspark, pandas and/or pyarrow is unavailable with proper message.

### Why are the changes needed?

Skip the test properly when it's unavailable.

### Does this PR introduce _any_ user-facing change?

No, it's test only.

### How was this patch tested?

Manually test to be skipped when missing package.

![Screen Shot 2022-09-20 at 6 11 30 PM](https://user-images.githubusercontent.com/44108233/191218078-254605b4-157e-4e22-8015-22399ff5e0b2.png)

Closes #37946 from itholic/SPARK-40419-followup.

Authored-by: itholic <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
dongjoon-hyun pushed a commit that referenced this pull request Dec 5, 2022
### What changes were proposed in this pull request?
Remove the file `results/udaf.sql.out` because it is not generated anymore after #37873.

### Why are the changes needed?
The duplicate file might confuse while searching some text in golden files.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
By running the affected test suite:
```
$ PYSPARK_PYTHON=python3 build/sbt "sql/testOnly org.apache.spark.sql.SQLQueryTestSuite"
```

Closes #38919 from MaxGekk/rm-udaf.sql.out.

Authored-by: Max Gekk <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
beliefer pushed a commit to beliefer/spark that referenced this pull request Dec 18, 2022
### What changes were proposed in this pull request?
Remove the file `results/udaf.sql.out` because it is not generated anymore after apache#37873.

### Why are the changes needed?
The duplicate file might confuse while searching some text in golden files.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
By running the affected test suite:
```
$ PYSPARK_PYTHON=python3 build/sbt "sql/testOnly org.apache.spark.sql.SQLQueryTestSuite"
```

Closes apache#38919 from MaxGekk/rm-udaf.sql.out.

Authored-by: Max Gekk <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
@itholic itholic deleted the SPARK-40419 branch April 22, 2023 05:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants