Skip to content

Conversation

@0xffmeta
Copy link
Contributor

Add instructions for hive on tez.

@pvary
Copy link
Contributor

pvary commented Jan 21, 2022

Since I am not a native English speaker, I would like @rdblue or @massdosage or someone to review the change.
The technical details are correct.
Thanks,
Peter


#### Hive on Tez configuration

To use Tez engine on Hive, Tez needs to be upgraded to >= `0.10.1` to contain the fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
To use Tez engine on Hive, Tez needs to be upgraded to >= `0.10.1` to contain the fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).
To use the Tez engine on Hive, Tez needs to be upgraded to >= `0.10.1` which contains a necessary fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the above for Hive >=3? If so, probably worth mentioning it. Does this also mean you need to override the Tez jar files that come with the standard Hive installation?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to Tez-4100, the Tez 0.10.1 release should be working with Hadoop 3.1.3. And for standard Hive installation (default Tez version should be 0.9.2), the Tez jar file needs to be patched. I'll add a comment for that.

To use Tez engine on Hive, Tez needs to be upgraded to >= `0.10.1` to contain the fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).

!!! Warning
For Hive `2.3.x`, need to manual build from Tez `branch-0.9` for compatibility issue in Tez `0.10.1`.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
For Hive `2.3.x`, need to manual build from Tez `branch-0.9` for compatibility issue in Tez `0.10.1`.
For Hive `2.3.x`, you will need to manually build Tez from the `branch-0.9` branch due to a backwards incompatibility issue with Tez `0.10.1`.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So if I understand correctly, Hive 2.x only works with Tez 0.9.x and not 0.10.x? And therefore one needs to compile and build a specific version of Tez oneself (from that branch?) and then override the version that comes with Hive 2.3.x?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Correct.

!!! Warning
For Hive `2.3.x`, need to manual build from Tez `branch-0.9` for compatibility issue in Tez `0.10.1`.

And also set the hive config `tez.mrreader.config.update.properties=hive.io.file.readcolumn.names,hive.io.file.readcolumn.ids`.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
And also set the hive config `tez.mrreader.config.update.properties=hive.io.file.readcolumn.names,hive.io.file.readcolumn.ids`.
You will also need to set the following property in the Hive configuration: `tez.mrreader.config.update.properties=hive.io.file.readcolumn.names,hive.io.file.readcolumn.ids`.

@0xffmeta
Copy link
Contributor Author

Hello @massdosage, thanks for the suggestion, I have changed the wording as suggested. Please take a look.


#### Hive on Tez configuration

To use the Tez engine on Hive(requires version >= `3.1.3`), Tez needs to be upgraded to >= `0.10.1` which contains a necessary fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is no 3.1.3 release ATM. Did you mean 3.1.2?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. I just changed to 3.1.2

Copy link
Contributor

@pvary pvary left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, one final check anyone?


#### Hive on Tez configuration

To use the Tez engine on Hive(requires version >= `3.1.2`), Tez needs to be upgraded to >= `0.10.1` which contains a necessary fix [Tez-4248](https://issues.apache.org/jira/browse/TEZ-4248).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The wording is a tiny bit confusing to me. (requires version >= 3.1.2) would indicate to me that Tez can only be used with Hive 3. Shall we reword it like this?
To use the Tez engine on Hive 3.1.2 or later, ...
followed by:
To use the Tez engine on Hive 2.3.x, ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep. That would be much more clear.
Updated the PR. Please take another look.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good now, thanks

@pvary pvary merged commit 1dcb74b into apache:master Jan 25, 2022
@pvary pvary changed the title add section to include instructions for hive on tez Docs: Add section to include instructions for Hive on Tez Jan 25, 2022
@0xffmeta
Copy link
Contributor Author

Just wondering when this change will be reflected into the website documentation. :)

@pvary
Copy link
Contributor

pvary commented Jan 26, 2022

I am not sure TBH.

The last discussion on the dev list about the docs site was this: https://lists.apache.org/thread/wq0qzbcwkjc13zp3j8mtkf8op1n5fn62

I am not sure what is the status, but if you have some time, it would be good to check out.

@massdosage
Copy link
Contributor

Sorry, I only had a chance to look at it today but I see it's already been approved. LGTM too!

@pvary
Copy link
Contributor

pvary commented Jan 26, 2022

Thanks for your time @0xffmeta, @massdosage and @marton-bod!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants