-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6511] [Documentation] Explain how to use Hadoop provided builds #6729
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This provides preliminary documentation pointing out how to use the Hadoop free builds. I am hoping over time this list can grow to include most of the popular Hadoop distributions. Getting more people using these builds will help us long term reduce the number of binaries we build.
|
To the extent of my knowledge, this is correct. More docs are good. |
|
Looks good. You can start going into more and more distro-specific details here if you want, but I'm not sure this document should cover those details. |
|
@vanzin Any specific details in mind? Personally I don't mind having details for different distros. My feeling is if people can't get these hadoop-free builds running easily, people won't use them. So it might be good to include all pertinent information. |
|
To be fair, any extra configuration also applies to the other builds, not just for the "hadoop-free" one. Each distro may have native libraries in different places, for example, so you may need to configure those paths appropriately. (A quick look at a CDH config seems to show that's mostly it, aside from what you already documented. But I'm not familiar with other distros.) |
|
Ah okay - I think it could be good to add those too over time. For instance the MapR one probably wont' work at all unless some native libraries are added since their FS client depends on native code. But sounds like people are okay with this as a first step. |
|
Test build #34529 has finished for PR 6729 at commit
|
|
Jenkins, retest this please. |
|
Test build #34533 has finished for PR 6729 at commit
|
This provides preliminary documentation pointing out how to use the Hadoop free builds. I am hoping over time this list can grow to include most of the popular Hadoop distributions. Getting more people using these builds will help us long term reduce the number of binaries we build. Author: Patrick Wendell <[email protected]> Closes #6729 from pwendell/hadoop-provided and squashes the following commits: 1113b76 [Patrick Wendell] [SPARK-6511] [Documentation] Explain how to use Hadoop provided builds (cherry picked from commit 6e4fb0c) Signed-off-by: Patrick Wendell <[email protected]>
This provides preliminary documentation pointing out how to use the Hadoop free builds. I am hoping over time this list can grow to include most of the popular Hadoop distributions. Getting more people using these builds will help us long term reduce the number of binaries we build. Author: Patrick Wendell <[email protected]> Closes apache#6729 from pwendell/hadoop-provided and squashes the following commits: 1113b76 [Patrick Wendell] [SPARK-6511] [Documentation] Explain how to use Hadoop provided builds
This provides preliminary documentation pointing out how to use the
Hadoop free builds. I am hoping over time this list can grow to
include most of the popular Hadoop distributions.
Getting more people using these builds will help us long term reduce
the number of binaries we build.