Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Jun 18, 2022

What changes were proposed in this pull request?

This PR aims to use IPv6 between Spark and Python Daemon in IPv6-only system.

Unlike spark-shell, pyspark starts Python shell and java-gateway first.
We need a new environment variable, SPARK_PREFER_IPV6=True in pyspark shell, like the following.

SPARK_PREFER_IPV6=True bin/pyspark --driver-java-options=-Djava.net.preferIPv6Addresses=true

Why are the changes needed?

Currently, PySpark uses 127.0.0.1 for inter-communication between Python Daemon and JVM.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass the CIs.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is #36905 and I'll rebase this PR after merging #36905.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-39508][CORE][PYTHON] Use IPv6 between JVM and Python Daemon in PySpark [SPARK-39508][CORE][PYTHON] Support IPv6 between JVM and Python Daemon in PySpark Jun 18, 2022
@dongjoon-hyun
Copy link
Member Author

Rebased to the master in order to bring #36905 .

@dongjoon-hyun
Copy link
Member Author

Hi, @HyukjinKwon . How do you think about this PR? This is a preparation for PySpark UTs.

@HyukjinKwon
Copy link
Member

Will review this today or tomorrow.

@HyukjinKwon
Copy link
Member

Merged to master.

@dongjoon-hyun
Copy link
Member Author

Thank you so much, @HyukjinKwon !

@dongjoon-hyun dongjoon-hyun deleted the SPARK-39508 branch June 20, 2022 03:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants