Skip to content

Conversation

@zsxwing
Copy link
Member

@zsxwing zsxwing commented Nov 18, 2016

What changes were proposed in this pull request?

The issue in ForeachSink is the new created DataSet still uses the old QueryExecution. When foreachPartition is called, QueryExecution.toString will be called and then fail because it doesn't know how to plan EventTimeWatermark.

This PR just replaces the QueryExecution with IncrementalExecution to fix the issue.

How was this patch tested?

test("foreach with watermark").

@zsxwing
Copy link
Member Author

zsxwing commented Nov 18, 2016

cc @marmbrus

@marmbrus
Copy link
Contributor

LGTM

@SparkQA
Copy link

SparkQA commented Nov 19, 2016

Test build #68873 has finished for PR 15934 at commit 0f3e4af.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@tdas
Copy link
Contributor

tdas commented Nov 19, 2016

Merging to master, 2.1.

@tdas
Copy link
Contributor

tdas commented Nov 19, 2016

Seems like this is a correctness bug as aggregation will give wrong results when using aggregations. This should ideally be fixed in 2.0 as well, independent of watermarks.

asfgit pushed a commit that referenced this pull request Nov 19, 2016
## What changes were proposed in this pull request?

The issue in ForeachSink is the new created DataSet still uses the old QueryExecution. When `foreachPartition` is called, `QueryExecution.toString` will be called and then fail because it doesn't know how to plan EventTimeWatermark.

This PR just replaces the QueryExecution with IncrementalExecution to fix the issue.

## How was this patch tested?

`test("foreach with watermark")`.

Author: Shixiong Zhu <[email protected]>

Closes #15934 from zsxwing/SPARK-18497.

(cherry picked from commit 2a40de4)
Signed-off-by: Tathagata Das <[email protected]>
@asfgit asfgit closed this in 2a40de4 Nov 19, 2016
@zsxwing zsxwing deleted the SPARK-18497 branch January 4, 2017 19:59
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
## What changes were proposed in this pull request?

The issue in ForeachSink is the new created DataSet still uses the old QueryExecution. When `foreachPartition` is called, `QueryExecution.toString` will be called and then fail because it doesn't know how to plan EventTimeWatermark.

This PR just replaces the QueryExecution with IncrementalExecution to fix the issue.

## How was this patch tested?

`test("foreach with watermark")`.

Author: Shixiong Zhu <[email protected]>

Closes apache#15934 from zsxwing/SPARK-18497.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants