-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-39863][BUILD] Upgrade Hadoop to 3.3.4 #37281
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice!
|
RC0 is actually gonna be cancelled but still worth testing it here. |
|
cancelled rc0 to get a later version of the reloadj4 in (XXE) and aws sdk upgrade. no direct security issue there, just that it depended on jackson, and, well.... |
|
btw, my build file for validating the rc can not only build spark, it can test the s3/azure/gcs stores through the release. |
|
Some tests are failing with the following error: Will take a look. cc @steveloughran . |
|
Looks OK now except one test failure which is unrelated to the Hadoop upgrade. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The python linter failure is irrelevant to this PR. Could you fill the PR description and convert to a normal PR, @sunchao ?
|
Sure @dongjoon-hyun . Updated. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. Thank you, @sunchao and @steveloughran .
Merged to master for Apache Spark 3.4.0.
| avro-mapred/1.11.0//avro-mapred-1.11.0.jar | ||
| avro/1.11.0//avro-1.11.0.jar | ||
| aws-java-sdk-bundle/1.11.1026//aws-java-sdk-bundle-1.11.1026.jar | ||
| aws-java-sdk-bundle/1.12.132//aws-java-sdk-bundle-1.12.132.jar |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sunchao Oh, master branch found that this is inconsistent from the official Apache Hadoop 3.3.4.
The staging artifacts in this PR seems to be outdated somehow. Let me make a followup.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
HADOOP-18344 changes AWS SDK at Apache Hadoop 3.3.4 RC1. So, this RC0-based dependency file is not updated. I found that the dependency test is not triggered due to the Python Linter flaky failure.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we are at 1.12.262. there's a CVE out on the aws sdk transfer manager for releases < 1.12.261, which the s3a connector isn't exposed to (it's only for downloads through that class), but which other apps using the same sdk may be.
### What changes were proposed in this pull request? This PR aims to update the dependency manifest for Hadoop 3. [HADOOP-18344](https://issues.apache.org/jira/browse/HADOOP-18344) changes AWS SDK at Apache Hadoop 3.3.4 RC1. ### Why are the changes needed? #37281 missed this inconsistency. ### Does this PR introduce _any_ user-facing change? No. This will recover the dependency check CI job. ### How was this patch tested? Pass the CI on this job. Closes #37447 from dongjoon-hyun/SPARK-39863. Authored-by: Dongjoon Hyun <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>
What changes were proposed in this pull request?
This PR aims to upgrade to Hadoop 3.3.4, which was just announced today.
Why are the changes needed?
Hadoop 3.3.4 comes with many bug fixes as well as CVE fixes. Please check release notes and change log.
Does this PR introduce any user-facing change?
No.
How was this patch tested?
Pass the CIs.