-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Make HoodieDeltaStreamer hive-sync to CDH Hive Servers #563
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
86dfe6f to
efedb63
Compare
|
@n3nash: Ready for review. |
vinothchandar
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
okay makes sense. you just moved all the dependencies bundled from the profiles, directly into the pom..
| <artifactId>commons-configuration2</artifactId> | ||
| </dependency> | ||
|
|
||
| <dependency> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the hive.version we use would be compatible with the spark's bundled hive? even without shading?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without shading, there is compatibility issue with spark's bundled hive. So, clients need to use hoodie-spark-bundle (instead of hoodie-spark).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is to even use the DataSource api? Lets chat once f2f
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes. Sounds good.
|
@n3nash @vinothchandar : Can one of you merge this if this looks ok. |
|
LGTM, I can merge this @vinothchandar |
Issue : Hoodie Utilities bundle does not include (and shade) hive jars unlike hoodie spark bundle. I have made changes to fix that.
I have also removed the profile settings and added doc to show users how to compile against hadoop/hive versions that is used in their deployments. This turns out to be more reliable than making Apache Hive-1.1.1 client work in CDH runtime environment. By removing the profile settings, there is no reference of CDH in Hoodie mvn code.
Testing:
Hoodie supports 3 types of environment (Apache Hive 2.x, Apache Hive 1.x, CDH Hive ). The maven profile settings change affects only the CDH version. I have tested hive sync against CDH environment.
We are also currently testing Deltastreamer against CDH environment to ensure they work end to end.