-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-14906][ML] Move VectorUDT and MatrixUDT in PySpark to new ML package #12870
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #57642 has finished for PR 12870 at commit
|
|
@viirya Thanks for this PR. I'll make a few high-level comments. We need to keep the old pyspark.mllib.linalg APIs. We also want to freeze the pyspark.mllib.linalg APIs, which means we need to keep that old code (not just alias it). Can you please modify the PR to do the following:
Thanks! |
|
Test build #57723 has finished for PR 12870 at commit
|
|
Test build #57725 has finished for PR 12870 at commit
|
|
@jkbradley Thanks for your comments. I want to make sure that you said we will freeze the pyspark.mllib.linalg APIs, so do I need to change the current PySpark ml/mllib codes which use pyspark.mllib.linalg APIs to use new pyspark.ml.linalg? I just did that. So if you mean we don't touch that, I need to revert it back. Thanks. |
|
Test build #57728 has finished for PR 12870 at commit
|
|
ping @jkbradley @dbtsai @mengxr |
…or-matrix-udt-3- Conflicts: python/pyspark/ml/tests.py python/pyspark/mllib/tests.py
|
Test build #58489 has started for PR 12870 at commit |
|
i will retrigger this build once maintenance is over. |
|
jenkins, test this please |
1 similar comment
|
jenkins, test this please |
|
@viirya Sorry for late response! The changes should be similar to mllib-local. The scope is not moving UDTs but copying the entire linalg package to |
|
Test build #58500 has finished for PR 12870 at commit
|
|
@mengxr Currently I copyed only Vector/Matrix and their UDTs to Do you mean that we want to keep |
|
@mengxr @dbtsai Separating the pickle paths for I will submit another PR for this jira that only copies |
|
Ping @mengxr |
What changes were proposed in this pull request?
Move the VectorUDT and MatrixUDT in PySpark to new ML package.
How was this patch tested?
Existing tests.