[SPARK-28243][PYSPARK][ML] Remove setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams#25046
[SPARK-28243][PYSPARK][ML] Remove setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams#25046huaxingao wants to merge 2 commits intoapache:masterfrom
Conversation
…amplingRate from Python TreeEnsembleParams
|
Test build #107187 has finished for PR 25046 at commit
|
|
@huaxingao I think you're probably right on this but can you remind us here why you also remove subsampling rate? the feature strategy setter is still on the Scala side; is it also meant to just move rather than go away? |
|
@srowen Sorry I didn't make it clear in the PR description. On Scala side, initially, both In this PR, I did the same thing on python side, I moved |
|
OK this is really a follow up of 4aa9ccb#diff-6b8a041f558af2b7bc50d930b1ad2670 then. I wonder if we missed any other setters that were removed in Scala? but this seems OK. CC @mgaido91 FYI |
|
I initially did this #21413 so I know I marked this I will also move all the other deprecated setters in 4aa9ccb#diff-6b8a041f558af2b7bc50d930b1ad2670 too. Do you prefer me to do it in this PR or have a separate PR? @srowen |
|
You can do it here if it's also just the Pyspark part of the change for consistency. Thanks! |
|
thanks for checking! Yes, it would be great to check them all. Thanks. |
|
I made modifications in python to match the changes in 4aa9ccb#diff-6b8a041f558af2b7bc50d930b1ad2670. However, I didn't move the following 4 setters: The reason that I didn't change these 4 setters is because besides |
|
Test build #107865 has finished for PR 25046 at commit
|
|
Merged to master |
…samplingRate from Python TreeEnsembleParams ## What changes were proposed in this pull request? Remove deprecated setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams ## How was this patch tested? Use existing tests. Closes apache#25046 from huaxingao/spark-28243. Authored-by: Huaxin Gao <huaxing@us.ibm.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>
|
@huaxingao @srowen @mgaido91 This is caused by that the There are too many design conflicts between the class hierarchy of scala and py, it's too confusing that can not be maintained easily. Maybe it is time to re-org the py side to keep it in line the scala side. I found this when I'm adding |
Yes, I do agree with you. we could also think to have a script which generates both APIs, in order to be sure that they are in sync. WDYT? |
|
Oh, hm, OK. I am not even sure if |
|
I will modify |
|
@mgaido91 It is a good idea. I think we may start with some script that only check the parity. @srowen @huaxingao I tend to change this part in #25383, maybe by moving |
What changes were proposed in this pull request?
Remove deprecated setFeatureSubsetStrategy and setSubsamplingRate from Python TreeEnsembleParams
How was this patch tested?
Use existing tests.