-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HADOOP-19019: Parallel Maven Build Support for Apache Hadoop #6373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
@Hexiaoqiao Could you help review this pr? |
|
@JiaLiangC Thanks for your work and involve me here. It is very interesting improvement. I want to know if any time cost save when change to parallel build. Another side, beside hadoop-yarn module, any other modules need to set dependency explicitly? Thanks again. |
|
@Hexiaoqiao |
|
Great! Thanks @JiaLiangC , Let's wait if anymore folks would like to give another review here. |
|
💔 -1 overall
This message was automatically generated. |
steveloughran
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 from me.
Your compile time is still pretty slow; does this include the test runs, or is it the hadoop client and javadocs taking the time?
|
Who is going to merge this? @Hexiaoqiao? |
|
If no more other concerns, I will check this PR into trunk for a short while. @steveloughran |
Hexiaoqiao
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. +1.
|
Committed to trunk. Thanks @JiaLiangC and @steveloughran . |
…6373). Contributed by JiaLiangC. Signed-off-by: Steve Loughran <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>
…6373). Contributed by JiaLiangC. Signed-off-by: Steve Loughran <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>
…6373). Contributed by JiaLiangC. Signed-off-by: Steve Loughran <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>

Description of PR
https://issues.apache.org/jira/browse/HDFS-17287
Here's the translation of the Hadoop PR description into English:
Hadoop Parallel Compilation Submission Logic
Reasons for Parallel Compilation Failure
But when Hadoop compiles in parallel, for example, compiling
hadoop-yarn-project, the dependencies between modules are correct. The issue arises during the dist package stage.distpackages all other compiled modules.Behavior of
hadoop-yarn-projectin Serial Compilation:hadoop-yarn-project. During theprepare-packagestage, themaven-assembly-pluginplugin is executed for packaging. All packages are repackaged according to the description inhadoop-assemblies/src/main/resources/assemblies/hadoop-yarn-dist.xml.Behavior of
hadoop-yarn-projectin Parallel Compilation:dependency, they are compiled in parallel. According to the dependency definition in the pom ofhadoop-yarn-project, the dependencies are compiled first, followed byhadoop-yarn-project, executing itsmaven-assembly-plugin.hadoop-assemblies/src/main/resources/assemblies/hadoop-yarn-dist.xmlare not all included in thedependencyofhadoop-yarn-project. Therefore, when compilinghadoop-yarn-projectand executingmaven-assembly-plugin, not all required modules are built yet, leading to errors in parallel compilation.Solution:
hadoop-assemblies/src/main/resources/assemblies/hadoop-yarn-dist.xml, and then declare them as dependencies in the pom ofhadoop-yarn-project.How was this patch tested?
manual test on centos8

For code changes:
LICENSE,LICENSE-binary,NOTICE-binaryfiles?