[SPARK-29637][CORE] Add description to Job SHS web API #26295

gaborgsomogyi · 2019-10-29T14:45:43Z

Why are the changes needed?

Starting from Spark 2.3, the SHS REST API endpoint /applications/<app_id>/jobs/ is not including description in the JobData returned. This is not the case until Spark 2.2.

In this PR I've added the mentioned field.

Does this PR introduce any user-facing change?

Yes.

Old API response:

[ {
  "jobId" : 0,
  "name" : "foreach at <console>:26",
  "submissionTime" : "2019-10-28T12:41:54.301GMT",
  "completionTime" : "2019-10-28T12:41:54.731GMT",
  "stageIds" : [ 0 ],
  "jobGroup" : "test",
  "status" : "SUCCEEDED",
  "numTasks" : 1,
  "numActiveTasks" : 0,
  "numCompletedTasks" : 1,
  "numSkippedTasks" : 0,
  "numFailedTasks" : 0,
  "numKilledTasks" : 0,
  "numCompletedIndices" : 1,
  "numActiveStages" : 0,
  "numCompletedStages" : 1,
  "numSkippedStages" : 0,
  "numFailedStages" : 0,
  "killedTasksSummary" : { }
} ]

New API response:

[ {
  "jobId" : 0,
  "name" : "foreach at <console>:26",
  "description" : "job",                            <= This is the addition here
  "submissionTime" : "2019-10-28T13:37:24.107GMT",
  "completionTime" : "2019-10-28T13:37:24.613GMT",
  "stageIds" : [ 0 ],
  "jobGroup" : "test",
  "status" : "SUCCEEDED",
  "numTasks" : 1,
  "numActiveTasks" : 0,
  "numCompletedTasks" : 1,
  "numSkippedTasks" : 0,
  "numFailedTasks" : 0,
  "numKilledTasks" : 0,
  "numCompletedIndices" : 1,
  "numActiveStages" : 0,
  "numCompletedStages" : 1,
  "numSkippedStages" : 0,
  "numFailedStages" : 0,
  "killedTasksSummary" : { }
} ]

How was this patch tested?

Extended + existing unit tests.

Manually:

Open spark-shell

scala> sc.setJobGroup("test", "job", false); 
scala> val foo = sc.textFile("/user/foo.txt");
foo: org.apache.spark.rdd.RDD[String] = /user/foo.txt MapPartitionsRDD[1] at textFile at <console>:24
scala> foo.foreach(println);

Access REST API http://SHS-host:port/api/v1/applications/<app-id>/jobs/

gaborgsomogyi · 2019-10-29T14:48:46Z

I'm fully aware that in case of bugs we normally create a new test containing the jira ID. In this case I've found a test which can be extended and thought a new test would be an overkill. If one thinks it's still worth to add a separate test just bring it up and we can make the adjustment.

MaxGekk

~~In the Does this PR introduce any user-facing change? section, I don't see the difference between New API response and Old API response or maybe I missed something?~~

gaborgsomogyi · 2019-10-29T15:47:31Z

@MaxGekk "description" : "job",

gaborgsomogyi · 2019-10-29T15:51:58Z

I've added a marker in the JSON to highlight it.

SparkQA · 2019-10-29T17:08:12Z

Test build #112850 has finished for PR 26295 at commit 005c3e5.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2019-10-29T19:00:32Z

Merging to master / 2.4.

Starting from Spark 2.3, the SHS REST API endpoint `/applications/<app_id>/jobs/` is not including `description` in the JobData returned. This is not the case until Spark 2.2. In this PR I've added the mentioned field. Yes. Old API response: ``` [ { "jobId" : 0, "name" : "foreach at <console>:26", "submissionTime" : "2019-10-28T12:41:54.301GMT", "completionTime" : "2019-10-28T12:41:54.731GMT", "stageIds" : [ 0 ], "jobGroup" : "test", "status" : "SUCCEEDED", "numTasks" : 1, "numActiveTasks" : 0, "numCompletedTasks" : 1, "numSkippedTasks" : 0, "numFailedTasks" : 0, "numKilledTasks" : 0, "numCompletedIndices" : 1, "numActiveStages" : 0, "numCompletedStages" : 1, "numSkippedStages" : 0, "numFailedStages" : 0, "killedTasksSummary" : { } } ] ``` New API response: ``` [ { "jobId" : 0, "name" : "foreach at <console>:26", "description" : "job", <= This is the addition here "submissionTime" : "2019-10-28T13:37:24.107GMT", "completionTime" : "2019-10-28T13:37:24.613GMT", "stageIds" : [ 0 ], "jobGroup" : "test", "status" : "SUCCEEDED", "numTasks" : 1, "numActiveTasks" : 0, "numCompletedTasks" : 1, "numSkippedTasks" : 0, "numFailedTasks" : 0, "numKilledTasks" : 0, "numCompletedIndices" : 1, "numActiveStages" : 0, "numCompletedStages" : 1, "numSkippedStages" : 0, "numFailedStages" : 0, "killedTasksSummary" : { } } ] ``` Extended + existing unit tests. Manually: * Open spark-shell ``` scala> sc.setJobGroup("test", "job", false); scala> val foo = sc.textFile("/user/foo.txt"); foo: org.apache.spark.rdd.RDD[String] = /user/foo.txt MapPartitionsRDD[1] at textFile at <console>:24 scala> foo.foreach(println); ``` * Access REST API `http://SHS-host:port/api/v1/applications/<app-id>/jobs/` Closes #26295 from gaborgsomogyi/SPARK-29637. Authored-by: Gabor Somogyi <[email protected]> Signed-off-by: Marcelo Vanzin <[email protected]> (cherry picked from commit 9c817a8) Signed-off-by: Marcelo Vanzin <[email protected]>

[SPARK-29637][CORE] Add description to Job SHS web API

005c3e5

MaxGekk reviewed Oct 29, 2019

View reviewed changes

MaxGekk approved these changes Oct 29, 2019

View reviewed changes

vanzin closed this in 9c817a8 Oct 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-29637][CORE] Add description to Job SHS web API #26295

[SPARK-29637][CORE] Add description to Job SHS web API #26295

Uh oh!

gaborgsomogyi commented Oct 29, 2019 •

edited

Loading

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

MaxGekk left a comment •

edited

Loading

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

SparkQA commented Oct 29, 2019

Uh oh!

vanzin commented Oct 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-29637][CORE] Add description to Job SHS web API #26295

[SPARK-29637][CORE] Add description to Job SHS web API #26295

Uh oh!

Conversation

gaborgsomogyi commented Oct 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

MaxGekk left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

gaborgsomogyi commented Oct 29, 2019

Uh oh!

SparkQA commented Oct 29, 2019

Uh oh!

vanzin commented Oct 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gaborgsomogyi commented Oct 29, 2019 •

edited

Loading

MaxGekk left a comment •

edited

Loading