-
Notifications
You must be signed in to change notification settings - Fork 2.8k
[ZEPPELIN-572] pyspark interpreter doesn't work on yarn-client #605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ries to workers when master is yarn-client
|
+1 |
|
Tested, +1 |
|
LGTM |
1 similar comment
|
LGTM |
|
Although release branch 'branch-0.5.6' is created by https://issues.apache.org/jira/browse/ZEPPELIN-567, I think this change worth to apply 0.5.6. Shell we merge it into both master and 'branch-0.5.6'? |
|
If there're no more discussions, i'm merging it into 'branch-0.5.6' and 'master' |
|
Sure. |
|
Since 0.5.6-incubating rc1 is in vote alreay, i'm merging it into master only. we can merge it into branch-0.5.6 anytime we want. |
|
Looks great, thank you. It definitely belongs to 0.5.6, will merge it there. |
### What is this PR for? Set `spark.yarn.isPython` to be `true` to distribute pyspark libraries to workers when master is `yarn-client` ### What type of PR is it? Bug Fix ### Is there a relevant Jira issue? [ZEPPELIN-572](https://issues.apache.org/jira/browse/ZEPPELIN-572) ### How should this be tested? You need yarn cluster to test this PR. Simple way to test this PR would be running below code in paragraph and see if it throws error. ``` %pyspark print(sc.parallelize([1, 2]).count()) ``` And you should be able to see that `spark.yarn.isPython` is set to `true` in **Spark UI > Environment > Spark Properties** only when you set spark.master as `yarn-client`. ### Questions: * Does the licenses files need update? No * Is there breaking changes for older versions? No * Does this needs documentation? No Author: Mina Lee <[email protected]> Closes #605 from minahlee/ZEPPELIN-572 and squashes the following commits: 8c99de5 [Mina Lee] Set spark.yarn.isPython to be true to distribute needed pyspark libraries to workers when master is yarn-client
|
This PR have apply in |
|
@darionyaphet 0.5.6 includes this patch |
|
Hi @Leemoonsoo It seem not effective . I build Zeppelin using command : When I using yarl-client to startup pyspark job I found it's not work . |
|
Are you sure you are setting master to yarn-client? |
|
@felixcheung Yep , I'm using |
|
Hi all, Maybe it's better to add it in the documentation? Cheers, |
|
@H4ml3t Sorry for the confusion, I also confirmed that this commit is not included in 0.5.6 binary package by checking source code in http://www.apache.org/dyn/closer.cgi/incubator/zeppelin/0.5.6-incubating/zeppelin-0.5.6-incubating.tgz. Thanks for reporting, I will update the doc |
What is this PR for?
Set
spark.yarn.isPythonto betrueto distribute pyspark libraries to workers when master isyarn-clientWhat type of PR is it?
Bug Fix
Is there a relevant Jira issue?
ZEPPELIN-572
How should this be tested?
You need yarn cluster to test this PR.
Simple way to test this PR would be running below code in paragraph and see if it throws error.
And you should be able to see that
spark.yarn.isPythonis set totruein Spark UI > Environment > Spark Properties only when you set spark.master asyarn-client.Questions: