Skip to content

Conversation

@HeartSaVioR
Copy link
Contributor

@HeartSaVioR HeartSaVioR commented Nov 24, 2022

What changes were proposed in this pull request?

This PR proposes to add a new test suite specifically for self-union tests on streaming query. The test cases are acceptance tests for 4 different cases, DSv1 vs DSv2 / DataStreamReader API vs table API.

Why are the changes needed?

This PR brings more test coverage on streaming workloads. We should have caught an issue during the work of SPARK-39564 if we had this test suite.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

New test suite.

@HeartSaVioR
Copy link
Contributor Author

cc. @zsxwing @viirya Please take a look. Thanks!

@viirya viirya changed the title [SPARK-41249][SS] Add acceptance test for self-union on streaming query [SPARK-41249][SS][TEST] Add acceptance test for self-union on streaming query Nov 24, 2022
val unionedDf = streamDf.union(streamDf)

testStream(unionedDf) (
AddData(stream, 1, 2, 3),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the data is written into the table location, instead of through the stream (i.e. like self-union, DSv1, read via table API), is it also working?

Copy link
Contributor Author

@HeartSaVioR HeartSaVioR Nov 24, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think so - here we also read from table via table API. (We use readStream.table.) It's just that the table is backed by MemoryStream.

Copy link
Member

@viirya viirya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. Great to see improved test coverage.

@HeartSaVioR
Copy link
Contributor Author

Thanks! Merging to master.

beliefer pushed a commit to beliefer/spark that referenced this pull request Dec 15, 2022
…ng query

### What changes were proposed in this pull request?

This PR proposes to add a new test suite specifically for self-union tests on streaming query. The test cases are acceptance tests for 4 different cases, DSv1 vs DSv2 / DataStreamReader API vs table API.

### Why are the changes needed?

This PR brings more test coverage on streaming workloads. We should have caught an issue during the work of [SPARK-39564](https://issues.apache.org/jira/browse/SPARK-39564) if we had this test suite.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

New test suite.

Closes apache#38785 from HeartSaVioR/SPARK-41249.

Authored-by: Jungtaek Lim <[email protected]>
Signed-off-by: Jungtaek Lim <[email protected]>
beliefer pushed a commit to beliefer/spark that referenced this pull request Dec 18, 2022
…ng query

### What changes were proposed in this pull request?

This PR proposes to add a new test suite specifically for self-union tests on streaming query. The test cases are acceptance tests for 4 different cases, DSv1 vs DSv2 / DataStreamReader API vs table API.

### Why are the changes needed?

This PR brings more test coverage on streaming workloads. We should have caught an issue during the work of [SPARK-39564](https://issues.apache.org/jira/browse/SPARK-39564) if we had this test suite.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

New test suite.

Closes apache#38785 from HeartSaVioR/SPARK-41249.

Authored-by: Jungtaek Lim <[email protected]>
Signed-off-by: Jungtaek Lim <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants