[SPARK-5801] [core] Avoid creating nested directories. #4747

vanzin · 2015-02-24T19:54:28Z

Cache the value of the local root dirs to use for storing local data,
so that the same directories are reused.

Also, to avoid an extra level of nesting, use a different env variable
to propagate the local dirs from the Worker to the executors. And make
the executor directory use a different name.

Cache the value of the local root dirs to use for storing local data, so that the same directory is reused. Also, to avoid an extra level of nesting, use a different env variable to propagate the local dirs from the Worker to the executors. And make the executor directory use a different name.

SparkQA · 2015-02-24T19:57:39Z

Test build #27903 has started for PR 4747 at commit 18ee0a7.

This patch merges cleanly.

srowen · 2015-02-24T20:25:29Z

core/src/main/scala/org/apache/spark/deploy/worker/ExecutorRunner.scala

I think these are dumb questions, but why does the fix entail setting a different env variable? and should dirs be joined with a path separator?

Without this, Utils.scala would always create a subdirectory for the executor under the directory already created by the Worker. So you'd always have "spark-xxxxx/spark-yyyyy" for every executor. It would be always like that (no extra nesting), but, since I'm changing this, sounded like a good enhancement.

And dirs should be joined with a path separator because that's the right way to do it.

Oh right, this is path.separator not file.separator, : not /.

Yes that's exactly the fix that needs to be made.

EDIT: I see it. I swear it didn't show the last file changed earlier. But it was probably user error.

vanzin · 2015-02-24T20:51:25Z

Hold off a little bit on this; let me see how to handle the SparkConf issue (I think Josh added that for the unit tests).

SparkQA · 2015-02-24T20:52:54Z

Test build #27903 has finished for PR 4747 at commit 18ee0a7.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-02-24T20:52:57Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27903/
Test FAILed.

andrewor14 · 2015-02-24T21:31:39Z

retest this please. We really need to fix the JavaAPISuite at some point.

SparkQA · 2015-02-24T21:32:44Z

Test build #27908 has started for PR 4747 at commit 18ee0a7.

This patch merges cleanly.

Add a way for the test to work around the cache (even though the test was passing without this). Also add a comment that explains the new behavior of the `getOrCreateLocalRootDirs()`.

SparkQA · 2015-02-24T21:37:40Z

Test build #27909 has started for PR 4747 at commit e0114e1.

This patch merges cleanly.

SparkQA · 2015-02-24T22:46:25Z

Test build #27908 has finished for PR 4747 at commit 18ee0a7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-02-24T22:46:29Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27908/
Test PASSed.

SparkQA · 2015-02-24T22:49:40Z

Test build #27909 has finished for PR 4747 at commit e0114e1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-02-24T22:49:44Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27909/
Test PASSed.

Sephiroth-Lin · 2015-02-25T07:44:23Z

core/src/main/scala/org/apache/spark/util/Utils.scala

May be we can call this function to delete the local root directory in non-yarn mode when application is exited or SparkContext is stoped.

Note this method doesn't delete anything. There's a separate PR to clean up the root directories (#4759).

srowen · 2015-02-25T10:46:12Z

core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala

Do we need to register these for cleanup at shutdown? cf. #4759 (comment)

Not needed (see SPARK-4834). That's done by the maybeCleanupApplication in this file when the application finishes.

srowen reviewed Feb 24, 2015
View reviewed changes

Update unit test.

e0114e1

Add a way for the test to work around the cache (even though the test was passing without this). Also add a comment that explains the new behavior of the `getOrCreateLocalRootDirs()`.

Sephiroth-Lin reviewed Feb 25, 2015
View reviewed changes

Sephiroth-Lin mentioned this pull request Feb 25, 2015

[SPARK-5830][Core]Don't create unnecessary directory for local root dir #4620

Closed

foxik mentioned this pull request Feb 25, 2015

[SPARK-5970][core] Register directory created in getOrCreateLocalRootDirs for automatic deletion. #4759

Closed

srowen reviewed Feb 25, 2015
View reviewed changes

asfgit closed this in df3d559 Feb 26, 2015

vanzin deleted the SPARK-5801 branch March 9, 2015 17:50

[SPARK-5801] [core] Avoid creating nested directories. #4747

[SPARK-5801] [core] Avoid creating nested directories. #4747

Uh oh!

Conversation

vanzin commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

srowen Feb 24, 2015

Choose a reason for hiding this comment

Uh oh!

vanzin Feb 24, 2015

Choose a reason for hiding this comment

Uh oh!

srowen Feb 24, 2015

Choose a reason for hiding this comment

Uh oh!

vanzin commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

AmplabJenkins commented Feb 24, 2015

Uh oh!

andrewor14 commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

AmplabJenkins commented Feb 24, 2015

Uh oh!

SparkQA commented Feb 24, 2015

Uh oh!

AmplabJenkins commented Feb 24, 2015

Uh oh!

Sephiroth-Lin Feb 25, 2015

Choose a reason for hiding this comment

Uh oh!

vanzin Feb 25, 2015

Choose a reason for hiding this comment

Uh oh!

srowen Feb 25, 2015

Choose a reason for hiding this comment

Uh oh!

vanzin Feb 25, 2015

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants