[SPARK-38821][PYTHON] Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2] #36356

Yikun · 2022-04-26T07:52:06Z

What changes were proposed in this pull request?

Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2].

Pandas get wrong results when np.nan in the sorting column since pandas-dev/pandas@16d2f59 (v1.4.0)

I confirmed this issue are fixed by:
pandas-dev/pandas@2886388

Why are the changes needed?

No

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI passed

Yikun · 2022-04-26T07:55:47Z

python/pyspark/pandas/tests/test_dataframe.py

+        if not (LooseVersion("1.4.0") <= LooseVersion(pd.__version__) <= LooseVersion("1.4.2")):
+            self.assert_eq(psdf.nlargest(5, columns="a"), pdf.nlargest(5, columns="a"))
+            self.assert_eq(
+                psdf.nlargest(5, columns=["a", "b"]), pdf.nlargest(5, columns=["a", "b"])
+            )


If you still think I need to compare with real results rather than skip, I'd also like to change. We need to change index=np.random.rand(7) to a certain range, and construct a result df.

Because this is only failed with panda 1.4.0~1.4.2, so I thought skip is enough.

Yikun · 2022-04-26T09:16:38Z

cc @itholic @xinrong-databricks @HyukjinKwon

HyukjinKwon · 2022-04-26T10:14:10Z

Merged to master.

Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2]

e639396

github-actions bot added CORE PYTHON labels Apr 26, 2022

Yikun commented Apr 26, 2022

View reviewed changes

HyukjinKwon approved these changes Apr 26, 2022

View reviewed changes

HyukjinKwon closed this in ac5ec64 Apr 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-38821][PYTHON] Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2] #36356

[SPARK-38821][PYTHON] Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2] #36356

Uh oh!

Yikun commented Apr 26, 2022

Uh oh!

Yikun Apr 26, 2022

Uh oh!

Yikun commented Apr 26, 2022

Uh oh!

HyukjinKwon commented Apr 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-38821][PYTHON] Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2] #36356

[SPARK-38821][PYTHON] Skip nsmall/nlarge nan test under pandas 1.4.[0,1,2] #36356

Uh oh!

Conversation

Yikun commented Apr 26, 2022

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Yikun Apr 26, 2022

Choose a reason for hiding this comment

Uh oh!

Yikun commented Apr 26, 2022

Uh oh!

HyukjinKwon commented Apr 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants