-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time #42115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
cc @HyukjinKwon @zhengruifeng FYI |
zhengruifeng
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM if CI pass
|
Maybe splitting into three is a stable choice. Let's try splitting into two first. Yesterday, splitting into three was successful. |
|
If dividing into two groups is still easy to fail, I do not oppose splitting into three groups :) |
If so, I suggest we'd better find better names for the groups (let's also take existing |
Yeah, let's probably go with #42146 way.
Ok, let me try again. |
|
I think it is fine to split not sure whether they are due to the same cause, but I also see It's kind of dangerous, since the GA recognized them as |
... |
|
@panbingkun yes, it seems the python packaging test is the blocker, please hold on this PR, let's fix it in #42146 first. |
|
@panbingkun after we disable packaging tests in |
Ok, let me turn this PR into the logic of rebalancing? 😄 |
yeah, many thanks! |
|
@HyukjinKwon @zhengruifeng @LuciferYang Line 22 in 748eaff
|
|
@panbingkun I see, what about fixing it after 3.5 is released? |
Okay, fine to me. |
|
@zhengruifeng |
Co-authored-by: Ruifeng Zheng <[email protected]>
|
|
||
| pyspark_pandas_slow_connect = Module( | ||
| name="pyspark-pandas-slow-connect", | ||
| dependencies=[pyspark_connect, pyspark_pandas, pyspark_pandas_slow], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should also set dependencies=[pyspark_connect, pyspark_pandas, pyspark_pandas_slow], in the above pyspark_pandas_connect module
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
…das-slow-connect GA testing time
HyukjinKwon
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am good with this. cc @itholic, @ueshin and @xinrong-meng
|
thanks, merged to master |









What changes were proposed in this pull request?
The pr aims to balancing
pyspark-pandas-connectandpyspark-pandas-slow-connectGA testing time.Why are the changes needed?
After pr: #42146, the difference in testing time between
pyspark-pandas-connectandpyspark-pandas-slow-connectis a bit significant, which affects the overall running time. In order to make GA operation more efficient and stable.Does this PR introduce any user-facing change?
No.
How was this patch tested?