-
Notifications
You must be signed in to change notification settings - Fork 1.5k
PARQUET-2158: Upgrade Hadoop dependency to version 3.2.0 #976
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -76,7 +76,7 @@ | |
| <jackson-databind.version>2.13.2.2</jackson-databind.version> | ||
| <japicmp.version>0.14.2</japicmp.version> | ||
| <shade.prefix>shaded.parquet</shade.prefix> | ||
| <hadoop.version>2.10.1</hadoop.version> | ||
| <hadoop.version>3.2.0</hadoop.version> | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. hmm why 3.2.0, not 3.3.1/3.3.2?
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. +1 for the question
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I was being unambitious. move to this, the oldest 3.x release working on java11 ensures that anything else on a version >= to this should link properly. if you do want to be more current, well, spark is on 3.3.3, hive is trying to move to 3.3.x and I will be doing a 3.3.4 release in a week's time, which is just some security changes mostly of relevance to servers |
||
| <parquet.format.version>2.9.0</parquet.format.version> | ||
| <previous.version>1.12.0</previous.version> | ||
| <thrift.executable>thrift</thrift.executable> | ||
|
|
@@ -519,6 +519,8 @@ | |
| change to fix a integer overflow issue. | ||
| TODO: remove this after Parquet 1.13 release --> | ||
| <exclude>org.apache.parquet.column.values.dictionary.DictionaryValuesWriter#dictionaryByteSize</exclude> | ||
| <!-- In PARQUET-2158 the return type of PathGlobPattern was changed to be compatible with Hadoop 3 --> | ||
| <exclude>org.apache.parquet.thrift.projection.deprecated.PathGlobPattern</exclude> | ||
| </excludes> | ||
| </parameter> | ||
| </configuration> | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this may not work for projects like Spark who are using Hadoop shaded client, since the
GlobPattern.compiledis relocated toorg.apache.hadoop.shaded.com.google.re2j.Pattern.It might be easier to just remove the class as it has been marked as deprecated since Parquet 1.8.0, 2015. It is also not used anywhere in the project.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 for cutting. i will update the patch