-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-8200] [MLlib] Check for empty RDDs in StreamingLinearAlgorithm #6713
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@pparkkin please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark Among other things this needs an actual title. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use !rdd.isEmpty as it will be notably faster.
|
Thanks for the quick review and comments! Sorry about not following the guide completely. I updated the PR title and description. I'll make the code changes when I get back to work tomorrow morning. |
|
Sorry, I merged another PR to fix the superfluous match construct here. You may need a rebase too now. |
e3e358f to
54ad89e
Compare
|
Done. |
|
OK to test |
|
Jenkins, test this please |
|
Test build #34587 has finished for PR 6713 at commit
|
|
Test build #894 timed out for PR 6713 at commit |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(no need for toString here but don't worry about it)
|
Is this ready -- just checking since there is [WIP] in the title. |
|
Changed title. |
Test cases for both StreamingLinearRegression and StreamingLogisticRegression, and code fix. Edit: This contribution is my original work and I license the work to the project under the project's open source license. Author: Paavo <[email protected]> Closes #6713 from pparkkin/streamingmodel-empty-rdd and squashes the following commits: ff5cd78 [Paavo] Update strings to use interpolation. db234cf [Paavo] Use !rdd.isEmpty. 54ad89e [Paavo] Test case for empty stream. 393e36f [Paavo] Ignore empty RDDs. 0bfc365 [Paavo] Test case for empty stream. (cherry picked from commit b928f54) Signed-off-by: Sean Owen <[email protected]>
Test cases for both StreamingLinearRegression and StreamingLogisticRegression, and code fix. Edit: This contribution is my original work and I license the work to the project under the project's open source license. Author: Paavo <[email protected]> Closes apache#6713 from pparkkin/streamingmodel-empty-rdd and squashes the following commits: ff5cd78 [Paavo] Update strings to use interpolation. db234cf [Paavo] Use !rdd.isEmpty. 54ad89e [Paavo] Test case for empty stream. 393e36f [Paavo] Ignore empty RDDs. 0bfc365 [Paavo] Test case for empty stream.
Test cases for both StreamingLinearRegression and StreamingLogisticRegression, and code fix.
Edit:
This contribution is my original work and I license the work to the project under the project's open source license.