Skip to content

Conversation

@tinhto-000
Copy link
Contributor

What changes were proposed in this pull request?

  1. Added missing match case to SparkUncaughtExceptionHandler, so that it would not halt the process when the exception doesn't match any of the match case statements.
  2. Added log message before halting process. During debugging it wasn't obvious why the Worker process would DEAD (until we set SPARK_NO_DAEMONIZE=1) due to the shell-scripts puts the process into background and essentially absorbs the exit code.
  3. Added SparkUncaughtExceptionHandlerSuite. Basically we create a Spark exception-throwing application with SparkUncaughtExceptionHandler and then check its exit code.

Why are the changes needed?

SPARK-30310, because the process would halt unexpectedly.

How was this patch tested?

All unit tests (mvn test) were ran and OK.

SparkUncaughtExceptionHandlerSuite had wrong name
use toBoolean/toString for converting string to/from boolean
exception throwers now check args.length before using args(0)
@SparkQA
Copy link

SparkQA commented Dec 24, 2019

Test build #4975 has finished for PR 26955 at commit 724e254.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@srowen
Copy link
Member

srowen commented Dec 26, 2019

Jenkins retest this please

@SparkQA
Copy link

SparkQA commented Dec 26, 2019

Test build #115817 has finished for PR 26955 at commit 724e254.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HeartSaVioR
Copy link
Contributor

cc. @gatorsmile @zsxwing

@srowen
Copy link
Member

srowen commented Dec 31, 2019

Ping @tinhto-000

Copy link
Contributor

@HeartSaVioR HeartSaVioR left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM except a nit

@HeartSaVioR
Copy link
Contributor

@srowen Would you mind if I ask about revisiting this? Maybe with "ok to test". Thanks in advance!

@srowen
Copy link
Member

srowen commented Jan 9, 2020

Jenkins, test this please

@SparkQA
Copy link

SparkQA commented Jan 10, 2020

Test build #116422 has finished for PR 26955 at commit 3f9e96f.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@srowen
Copy link
Member

srowen commented Jan 10, 2020

Jenkins, test this please

@HeartSaVioR
Copy link
Contributor

HeartSaVioR commented Jan 10, 2020

It's known failure failing consistently from yesterday. I pinged @HyukjinKwon to kindly ask about taking a look (as I don't have a context on pyspark) but it seems to be better to disable the test first as multiple PRs are affected.

@HyukjinKwon
Copy link
Member

Sorry I missed pings @HeartSaVioR - I am currently stuck at some works :(.

@HeartSaVioR
Copy link
Contributor

HeartSaVioR commented Jan 10, 2020

Oh OK. Never mind about missing ping. What about disabling the test and file an issue then? I've submitted #27158 to disable the test.

@SparkQA
Copy link

SparkQA commented Jan 10, 2020

Test build #116438 has finished for PR 26955 at commit 3f9e96f.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

dbtsai pushed a commit to dbtsai/spark that referenced this pull request Jan 10, 2020
### What changes were proposed in this pull request?

This patch increases the memory limit in the test 'test_memory_limit' from 1m to 8m.
Credit to srowen and HyukjinKwon to provide the idea of suspicion and guide how to fix.

### Why are the changes needed?

We observed consistent Pyspark test failures on multiple PRs (apache#26955, apache#26201, apache#27064) which block the PR builds whenever the test is included.

### Does this PR introduce any user-facing change?

No.

### How was this patch tested?

Jenkins builds passed in WIP PR (apache#27159)

Closes apache#27162 from HeartSaVioR/SPARK-30480.

Authored-by: Jungtaek Lim (HeartSaVioR) <[email protected]>
Signed-off-by: HyukjinKwon <[email protected]>
@HyukjinKwon
Copy link
Member

retest this please

@SparkQA
Copy link

SparkQA commented Jan 10, 2020

Test build #116454 has finished for PR 26955 at commit 3f9e96f.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HeartSaVioR
Copy link
Contributor

retest this, please

@srowen
Copy link
Member

srowen commented Jan 13, 2020

Jenkins, retest this please

@SparkQA
Copy link

SparkQA commented Jan 13, 2020

Test build #116645 has finished for PR 26955 at commit 3f9e96f.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@srowen
Copy link
Member

srowen commented Jan 17, 2020

Merged to master

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants