[SPARK-36599][CORE] Fix the http class server to work again #33849

yellowflash · 2021-08-26T10:19:46Z

HTTP based class servers no longer work because Spark switched to Hadoop Filesystem based implementation for HTTP class servers and the hadoop http filesystem is quirky in the way it accepts paths.

AmplabJenkins · 2021-08-26T10:51:34Z

Can one of the admins verify this patch?

srowen · 2021-08-28T14:18:04Z

repl/src/main/scala/org/apache/spark/repl/ExecutorClassLoader.scala

  val parentLoader = new ParentClassLoader(parent)

-  // Allows HTTP connect and read timeouts to be controlled for testing / debugging purposes
-  private[repl] var httpUrlConnectionTimeoutMillis: Int = -1


I agree this is unused

srowen · 2021-08-28T14:18:16Z

repl/src/main/scala/org/apache/spark/repl/ExecutorClassLoader.scala

  private def getClassFileInputStreamFromFileSystem(fileSystem: FileSystem)(
      pathInDirectory: String): InputStream = {
-    val path = new Path(directory, pathInDirectory)
+    val path = new Path(new Path(uri), pathInDirectory)


Can you explain a little more what inputs fail without this change?

Hadoop Http filesystem require the paths to be fully qualified URLs.
It does path.toUri.toUrl which fails in our case because the Path is not fully qualified.
So the class loader doesn't work if the class uri is http://..../ I raised a PR on hadoop too,

apache/hadoop#3338
But this is a regression on spark, as it used to work with a very specific implementation for http based class servers and now since it uses the Filesystem api for everything and it doesn't work anymore.

yellowflash · 2021-09-03T05:59:34Z

This hasn't been working since, 5085739

github-actions · 2021-12-13T00:11:15Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

github-actions bot added the SPARK SHELL label Aug 26, 2021

yellowflash force-pushed the fix-empty-path-absolute-uri-http branch from 5ff0a30 to eea16e8 Compare August 26, 2021 10:23

yellowflash changed the title ~~SPARK-36599 Fix the http class server to work again~~ [SPARK-36599][CORE] Fix the http class server to work again Aug 26, 2021

yellowflash force-pushed the fix-empty-path-absolute-uri-http branch from eea16e8 to 807c0d9 Compare August 26, 2021 10:28

yellowflash force-pushed the fix-empty-path-absolute-uri-http branch from 807c0d9 to c3fb027 Compare August 27, 2021 00:42

SPARK-36599 Fix the http class server to work again

3c1ed7b

yellowflash force-pushed the fix-empty-path-absolute-uri-http branch from c3fb027 to 3c1ed7b Compare August 27, 2021 00:47

srowen reviewed Aug 28, 2021

View reviewed changes

github-actions bot added the Stale label Dec 13, 2021

github-actions bot closed this Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-36599][CORE] Fix the http class server to work again #33849

[SPARK-36599][CORE] Fix the http class server to work again #33849

Uh oh!

yellowflash commented Aug 26, 2021 •

edited

Loading

Uh oh!

AmplabJenkins commented Aug 26, 2021

Uh oh!

srowen Aug 28, 2021

Uh oh!

srowen Aug 28, 2021

Uh oh!

yellowflash Aug 30, 2021 •

edited

Loading

Uh oh!

yellowflash commented Sep 3, 2021

Uh oh!

github-actions bot commented Dec 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-36599][CORE] Fix the http class server to work again #33849

[SPARK-36599][CORE] Fix the http class server to work again #33849

Uh oh!

Conversation

yellowflash commented Aug 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmplabJenkins commented Aug 26, 2021

Uh oh!

srowen Aug 28, 2021

Choose a reason for hiding this comment

Uh oh!

srowen Aug 28, 2021

Choose a reason for hiding this comment

Uh oh!

yellowflash Aug 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yellowflash commented Sep 3, 2021

Uh oh!

github-actions bot commented Dec 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yellowflash commented Aug 26, 2021 •

edited

Loading

yellowflash Aug 30, 2021 •

edited

Loading