-
Notifications
You must be signed in to change notification settings - Fork 29.1k
[MESOS][SPARK-8873] Clean up external shuffle data on driver exit with Mesos. #7820
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
fb11a33 to
1910bd5
Compare
|
Test build #39186 has finished for PR 7820 at commit
|
|
Test build #39188 has finished for PR 7820 at commit
|
|
retest this please |
|
Test build #39281 has finished for PR 7820 at commit
|
|
retest this please |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
private[spark]
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
also please add a simple java doc, even something like A message sent from the driver to register with this shuffle service. would do.
|
@tnachen looks great. My comments are mostly minor but I think this is very close. |
|
Test build #39344 has finished for PR 7820 at commit
|
1910bd5 to
e0f963b
Compare
|
Test build #39419 has finished for PR 7820 at commit
|
e0f963b to
aca93da
Compare
|
Test build #39421 has finished for PR 7820 at commit
|
aca93da to
fadff89
Compare
|
Test build #39434 has finished for PR 7820 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This interface is confusing: we pass in both a SparkConf and a TransportConf, even though one is created from another one.
|
@tnachen Thanks for addressing the comments quickly. The latest changes need a bit of clean up, however. Have you had a chance to test the latest changes on a real cluster and verify that the shuffle files did get cleaned up? |
|
Yes I tested with a single master and slave and it worked.
|
|
Can you close this one now that #7881 is already merged? |
…ce is used This patch builds directly on #7820, which is largely written by tnachen. The only addition is one commit for cleaning up the code. There should be no functional differences between this and #7820. Author: Timothy Chen <[email protected]> Author: Andrew Or <[email protected]> Closes #7881 from andrewor14/tim-cleanup-mesos-shuffle and squashes the following commits: 8894f7d [Andrew Or] Clean up code 2a5fa10 [Andrew Or] Merge branch 'mesos_shuffle_clean' of github.com:tnachen/spark into tim-cleanup-mesos-shuffle fadff89 [Timothy Chen] Address comments. e4d0f1d [Timothy Chen] Clean up external shuffle data on driver exit with Mesos.
|
This is already merged. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we still need the old one, start-shuffle-service.sh? It was only used by Mesos as far as I remember.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we need to keep it for backward compatibility unfortunately.
@andrewor14 @dragos