Skip to content

Comments

[SPARK-28622][SQL][PYTHON] Rename PullOutPythonUDFInJoinCondition to ExtractPythonUDFFromJoinCondition and move to 'Extract Python UDFs'#25358

Closed
HyukjinKwon wants to merge 4 commits intoapache:masterfrom
HyukjinKwon:move-python-join-rule
Closed

[SPARK-28622][SQL][PYTHON] Rename PullOutPythonUDFInJoinCondition to ExtractPythonUDFFromJoinCondition and move to 'Extract Python UDFs'#25358
HyukjinKwon wants to merge 4 commits intoapache:masterfrom
HyukjinKwon:move-python-join-rule

Conversation

@HyukjinKwon
Copy link
Member

@HyukjinKwon HyukjinKwon commented Aug 5, 2019

What changes were proposed in this pull request?

This PR targets to rename PullOutPythonUDFInJoinCondition to ExtractPythonUDFFromJoinCondition and move to 'Extract Python UDFs' together with other Python UDF related rules.

Currently PullOutPythonUDFInJoinCondition rule is alone outside of other 'Extract Python UDFs' rules together.

and the name ExtractPythonUDFFromJoinCondition is matched to existing Python UDF extraction rules.

How was this patch tested?

Existing tests should cover.

@SparkQA

This comment has been minimized.

@HyukjinKwon HyukjinKwon changed the title [WIP] Move PullOutPythonUDFInJoinCondition rule into 'Extract Python UDFs' [SPARK-28622][SQL][PYTHON] Move PullOutPythonUDFInJoinCondition rule into 'Extract Python UDFs' Aug 5, 2019
@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

override def defaultBatches: Seq[Batch] = (preOptimizationBatches ++ super.defaultBatches :+
Batch("Optimize Metadata Only Query", Once, OptimizeMetadataOnlyQuery(catalog)) :+
Batch("Extract Python UDFs", Once,
PullOutPythonUDFInJoinCondition,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rename it to ExtractPythonUDFFromJoinCondition ?

Copy link
Member

@gatorsmile gatorsmile Aug 5, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A separate question. Do we have a test case that ensure the local predicate that only contains PythonUDF will be pushed through Join? cc @cloud-fan

Copy link
Contributor

@cloud-fan cloud-fan Aug 5, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure. maybe we can add one in ExtractPythonUDFsSuite

@HyukjinKwon HyukjinKwon changed the title [SPARK-28622][SQL][PYTHON] Move PullOutPythonUDFInJoinCondition rule into 'Extract Python UDFs' [SPARK-28622][SQL][PYTHON] Rename PullOutPythonUDFInJoinCondition to ExtractPythonUDFFromJoinCondition and move to 'Extract Python UDFs' Aug 5, 2019
@HyukjinKwon HyukjinKwon force-pushed the move-python-join-rule branch from 7408b3a to 4bc07bb Compare August 5, 2019 14:57
@SparkQA

This comment has been minimized.

@SparkQA
Copy link

SparkQA commented Aug 5, 2019

Test build #108673 has finished for PR 25358 at commit e6d22c8.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@gatorsmile
Copy link
Member

LGTM

Merged to master.

@gatorsmile gatorsmile closed this in bab88c4 Aug 6, 2019
@HyukjinKwon HyukjinKwon deleted the move-python-join-rule branch March 3, 2020 01:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants