-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[MINOR] Support column type evolution for Hive #7129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
@xiarixiaoyao please help to review. |
| </dependency> | ||
|
|
||
| <!-- Hive --> | ||
| <dependency> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why we need hive dependency here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
changed. now dont need to add hive dependency
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is still showing up. Can you remove this dependency as the change is in hudi-hadoop-mr package and hive-exec is already defined as dependency in hudi-hadoop-mr /pom.xml
| @UseFileSplitsFromInputFormat | ||
| public class HoodieParquetInputFormat extends HoodieParquetInputFormatBase { | ||
| public class HoodieParquetInputFormat extends HoodieParquetInputFormatBase implements | ||
| SelfDescribingInputFormatInterface { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it will be better to implements SelfDescribingInputFormatInterface by HoodieParquetInputFormatBase
let's move this change to HoodieParquetInputFormatBase
public abstract class HoodieParquetInputFormatBase extends MapredParquetInputFormat implements Configurable, SelfDescribingInputFormatInterface
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok. has changed
leesf
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, Please the CI failure and fix
|
Because of SelfDescribingInputFormatInterface not exist in hive 1.x, so i have to add dependency in hudi-spark-common. Otherwise, the SelfDescribingInputFormatInterface class cannot be found if Hive 1.x is used during hudi-spark-common compilation. |
|
@hudi-bot run azure |
|
@hudi-bot run azure |
|
@hudi-bot run azure |
|
LGTM as well, though I am still curious to understand in detail how does this interface |
In hive FetechOpertor, will judge needConversion by the inputformot whether implement SelfDescribingInputFormatInterface |
| <artifactId>hive-common</artifactId> | ||
| <version>${hive.version}</version> | ||
| </dependency> | ||
| <dependency> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you remove this dependency as the change is in hudi-hadoop-mr package and hive-exec is already defined as dependency in hudi-hadoop-mr /pom.xml
| </dependency> | ||
|
|
||
| <!-- Hive --> | ||
| <dependency> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is still showing up. Can you remove this dependency as the change is in hudi-hadoop-mr package and hive-exec is already defined as dependency in hudi-hadoop-mr /pom.xml
|
@fsilent : Thanks a lot for the PR. Can you fix the package dependencies and also add a jira ticket in the description. |
yihua
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bvaradar is this change still needed?

Change Logs
#6989 cannot support column type evolution.
class that implements SelfDescribingInputFormatInterface,
hive will try to do colum type evolution by itself
Impact
support type change
Risk level (write none, low medium or high below)
low.
Documentation Update
N/A
Contributor's checklist