-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-7806][EC2] Fixes that allow the spark_ec2.py tool to run with Python3 #6336
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The changes are pretty minimal honestly. I haven't tested 100% of the possible permutations. Much thanks go to Eric Jonas who turned us on to this project. |
|
Jenkins, this is ok to test. |
|
@nchammas is probably the right person to review this. Seems pretty straightforward to me. |
|
Test build #33302 has finished for PR 6336 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, without this it passes a float to the start instances which get formatted like "1.0" and gets some xml barf back from amazon.
|
Thanks for this @meawoppl. I left some minor comments. |
|
I left my responses inline. There don't seem to be any tests that get applied to this script that I can find. . . am I right about that? |
|
Nope, no tests apart from This patch LGTM. |
|
Cool. Do I need to do anything like sign a contribution agreement, or put something in an authors file? We here at 3scan are going to be making heavy use of spark, and I am happy to allocate some dev-time of myself and my team to get spark/Pyspark running really smoothly in ec2. The next problem I have in mind is the one pertaining to large-cluster startup time. I can't find the JIRA issue off hand, but I suspect we can architect something to improve that situation significantly. Please let me know if there are specific issues that might desire concerted effort. |
Nope, I think you're all set here.
Take a look at these issues:
I put in quite a bit of research into the problem of long launch times and would be more than happy to help you improve spark-ec2 in that area. Let's continue this discussion on the appropriate JIRAs. |
|
LGTM, merge this into master and 1.4 branch. |
…Python3 I have used this script to launch, destroy, start, and stop clusters successfully. Author: meawoppl <[email protected]> Closes #6336 from meawoppl/py3ec2spark and squashes the following commits: 2e87046 [meawoppl] Py3 compat fixes. (cherry picked from commit 8dbe777) Signed-off-by: Davies Liu <[email protected]>
|
Thanks guys! |
…Python3 I have used this script to launch, destroy, start, and stop clusters successfully. Author: meawoppl <[email protected]> Closes apache#6336 from meawoppl/py3ec2spark and squashes the following commits: 2e87046 [meawoppl] Py3 compat fixes.
…Python3 I have used this script to launch, destroy, start, and stop clusters successfully. Author: meawoppl <[email protected]> Closes apache#6336 from meawoppl/py3ec2spark and squashes the following commits: 2e87046 [meawoppl] Py3 compat fixes.
|
This script is also in a weird state as it depends on the spark-mesos tooling, as well as some external deps (compiled versions/binaries etc), so its administration and updating dosen't have a single strong champion across projects. |
|
@meawoppl, there's a discussion of some of those issues on the spark-dev mailing list: http://mail-archives.apache.org/mod_mbox/incubator-spark-dev/201507.mbox/%3CCAOhmDzcnYgswssNP11VbGzSLisOKjGfnuMQMQc7yHiDL5SusmA%40mail.gmail.com%3E |
|
I dosen't appear there was ever a consensus reached there. My major --Matthew Goodman Check Out My Website: http://craneium.net On Thu, Jul 9, 2015 at 4:45 PM, Josh Rosen [email protected] wrote:
|
|
Should we chime in on that thread? --Matthew Goodman Check Out My Website: http://craneium.net On Thu, Jul 9, 2015 at 9:45 PM, Matt Goodman [email protected] wrote:
|
|
@meawoppl yes please |
|
Done. --Matthew Goodman Check Out My Website: http://craneium.net On Fri, Jul 10, 2015 at 12:34 AM, Sean Owen [email protected]
|
I have used this script to launch, destroy, start, and stop clusters successfully.