[SPARK-20894][SS]Resolve the checkpoint location in driver and use the resolved path in state store by zsxwing · Pull Request #18149 · apache/spark

zsxwing · 2017-05-30T22:23:58Z

What changes were proposed in this pull request?

When the user runs a Structured Streaming query in a cluster, if the driver uses the local file system, StateStore running in executors will throw a file-not-found exception. However, the current error is not obvious.

This PR makes StreamExecution resolve the path in driver and uses the full path including the scheme part (such as hdfs:/, file:/) in StateStore.

Then if the above error happens, StateStore will throw an error with this full path which starts with file:/, and it makes this error obvious: the checkpoint location is on the local file system.

One potential minor issue is that the user cannot use different default file system settings in driver and executors (e.g., use a public HDFS address in driver and a private HDFS address in executors) after this change. However, since the batch query also has this issue (See

spark/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala

Line 402 in 4bb6a53

path.makeQualified(fs.getUri, fs.getWorkingDirectory)

), it doesn't make things worse.

How was this patch tested?

The new added test.

…n state store

SparkQA · 2017-05-31T00:35:39Z

Test build #77558 has finished for PR 18149 at commit 133f0dd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2017-05-31T19:56:01Z


  /**
   * Processes any data available between `availableOffsets` and `committedOffsets`.
+   *


NIT: extra line

tdas · 2017-05-31T19:58:38Z

+      val query = MemoryStream[Int].toDF
+        .writeStream
+        .option("checkpointLocation", checkpointLocation)
+        .format("console").start()


NIT: .start() on the next line.

tdas · 2017-05-31T20:00:54Z

roughly LGTM, as long as you resolve conflicts and tests pass.

SparkQA · 2017-05-31T22:25:37Z

Test build #77605 has finished for PR 18149 at commit b099c56.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class DayOfWeek(child: Expression) extends UnaryExpression with ImplicitCastInputTypes
case class StringReplace(srcExpr: Expression, searchExpr: Expression, replaceExpr: Expression)
trait Command extends LogicalPlan
case class ExecutedCommandExec(cmd: RunnableCommand, children: Seq[SparkPlan]) extends SparkPlan
case class StateStoreId(
class UnsafeRowPair(var key: UnsafeRow = null, var value: UnsafeRow = null)
trait StateStoreWriter extends StatefulOperator

zsxwing · 2017-06-01T00:23:56Z

Thanks! Merging to master and ~~2.2~~.

…he resolved path in state store When the user runs a Structured Streaming query in a cluster, if the driver uses the local file system, StateStore running in executors will throw a file-not-found exception. However, the current error is not obvious. This PR makes StreamExecution resolve the path in driver and uses the full path including the scheme part (such as `hdfs:/`, `file:/`) in StateStore. Then if the above error happens, StateStore will throw an error with this full path which starts with `file:/`, and it makes this error obvious: the checkpoint location is on the local file system. One potential minor issue is that the user cannot use different default file system settings in driver and executors (e.g., use a public HDFS address in driver and a private HDFS address in executors) after this change. However, since the batch query also has this issue (See https://github.com/apache/spark/blob/4bb6a53ebd06de3de97139a2dbc7c85fc3aa3e66/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala#L402), it doesn't make things worse. The new added test. Author: Shixiong Zhu <shixiong@databricks.com> Closes apache#18149 from zsxwing/SPARK-20894.

Resolve the checkpoint location in driver and use the resolved path i…

133f0dd

…n state store

tdas reviewed May 31, 2017

View reviewed changes

Merge remote-tracking branch 'origin/master' into SPARK-20894

b099c56

asfgit closed this in 2bc3272 Jun 1, 2017

zsxwing deleted the SPARK-20894 branch June 1, 2017 00:29

zsxwing mentioned this pull request Jun 1, 2017

[SPARK-20894][SS] Resolve the checkpoint location in driver and use the resolved path in state store (branch-2.2) #18179

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-20894][SS]Resolve the checkpoint location in driver and use the resolved path in state store#18149

[SPARK-20894][SS]Resolve the checkpoint location in driver and use the resolved path in state store#18149
zsxwing wants to merge 2 commits intoapache:masterfrom
zsxwing:SPARK-20894

zsxwing commented May 30, 2017

Uh oh!

SparkQA commented May 31, 2017

Uh oh!

tdas May 31, 2017 •

edited

Loading

Uh oh!

tdas May 31, 2017 •

edited

Loading

Uh oh!

tdas commented May 31, 2017

Uh oh!

SparkQA commented May 31, 2017

Uh oh!

zsxwing commented Jun 1, 2017 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zsxwing commented May 30, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented May 31, 2017

Uh oh!

tdas May 31, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tdas May 31, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tdas commented May 31, 2017

Uh oh!

SparkQA commented May 31, 2017

Uh oh!

zsxwing commented Jun 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tdas May 31, 2017 •

edited

Loading

tdas May 31, 2017 •

edited

Loading

zsxwing commented Jun 1, 2017 •

edited

Loading