-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-32972][ML] Pass all UTs of mllib module in Scala 2.13
#29857
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #129054 has finished for PR 29857 at commit
|
|
cc @dongjoon-hyun #29861 fix GitHub 2.13 build Action, related to k8s module, I will rebase this pr after it merged. |
|
cc @srowen The remaining failed case is spark/mllib/src/test/scala/org/apache/spark/ml/regression/RandomForestRegressorSuite.scala Lines 171 to 200 in 0bc0e91
Input failed. I found that the following spark/mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala Lines 191 to 215 in 0bc0e91
but I am not familiar with this algorithm and I not find root cause, I think we need an expert to guide how to fix it |
|
spark/mllib/src/test/scala/org/apache/spark/ml/regression/RandomForestRegressorSuite.scala Lines 45 to 54 in 0bc0e91
And I found if we change |
|
@LuciferYang . I closed your #29861 because |
|
You need to rebase this branch . |
|
@dongjoon-hyun Address 4f5eac5 rebase master |
|
Test build #129065 has finished for PR 29857 at commit
|
|
For the random forest test, it's probably reasonably to simply change the tol in the last test to 0.75 like the others. I don't know why it should be higher. If you like you can try making the ordering of the Map in this code deterministic to see if that does it: But, I don't even know if the result the test is complaining about is 'wrong'. |
OK ~ I will try this first and feedback later. But change 0.78 to 0.75 seems simpler, haha :) |
|
Synchronize the test result: spark/mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala Lines 194 to 202 in 0bc0e91
Which do you recommend better @srowen , B or C, B can passed all test case. |
|
I think the latter option, changing the tol, is fine. |
|
ok ~ |
|
Address f2a26c5 fix RandomForestRegressorSuite. |
mllib module in Scala 2.13 except RandomForestRegressorSuitemllib module in Scala 2.13
|
Kubernetes integration test starting |
|
Kubernetes integration test status success |
|
Test build #129109 has finished for PR 29857 at commit
|
|
@srowen Is there any other problem in this pr that needs to be fixed?thx ~ |
|
From the first round of code checking, may be this is the last module to fix Scala version compatibility |
|
Merged to master |
|
thx @srowen ~ |
|
@dongjoon-hyun Should we consider adding some new GitHubActions to check test in Scala 2.13? |
What changes were proposed in this pull request?
The purpose of this pr is to resolve SPARK-32972, total of 51 Scala failed test cases and 3 Java failed test cases were fixed, the main change of this pr as follow:
Specified
Seqtoscala.collection.Seqin case matchSeqscene andx.asInstanceOf[Seq[T]]sceneUse
Row.getSeq[T]instead ofRow.getAs[Seq]Manual call
toMapmethod to convertMapViewtoMapin Scala 2.13Change the tol in the last test to 0.75 to pass
RandomForestRegressorSuite#training with sample weightsin Scala 2.13Why are the changes needed?
We need to support a Scala 2.13 build.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Scala 2.12: Pass the Jenkins or GitHub Action
Scala 2.13: Pass GitHub 2.13 Build Action
Do the follow:
Before
After