-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-1586] [Common Core] [Flink Integration] Reduce the coupling of hadoop. #2540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
178f13b
2a2da2a
9ef4f4a
2e45430
326399f
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -144,6 +144,7 @@ | |
| <artifactId>*</artifactId> | ||
| </exclusion> | ||
| </exclusions> | ||
| <scope>provided</scope> | ||
| </dependency> | ||
| <dependency> | ||
| <groupId>org.apache.hadoop</groupId> | ||
|
|
@@ -154,6 +155,7 @@ | |
| <dependency> | ||
| <groupId>org.apache.hadoop</groupId> | ||
| <artifactId>hadoop-hdfs</artifactId> | ||
| <scope>provided</scope> | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. wondering if the parent pom scope is already
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. +1, I think we can remove all the unnecessary Hadoop dependency from the sub-modules since they already exist in the parent pom. Do you mind removing those in this PR @Zhangchaoming ?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @garyli1019 My pleasure. |
||
| </dependency> | ||
| <dependency> | ||
| <groupId>org.apache.hadoop</groupId> | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -80,9 +80,6 @@ public class FSUtils { | |
| private static final PathFilter ALLOW_ALL_FILTER = file -> true; | ||
|
|
||
| public static Configuration prepareHadoopConf(Configuration conf) { | ||
| conf.set("fs.hdfs.impl", org.apache.hadoop.hdfs.DistributedFileSystem.class.getName()); | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. this is some old old code. Have you ensured that this can run on a non HDFS filesystem now? (HDFS is testing in unit and integration tests already)
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @vinothchandar On my machine, the implementation of filesyetem is
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Just to be clear. This code basically says, if you use
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I wanna express that this hard code will ignore users configuration. |
||
| conf.set("fs.file.impl", org.apache.hadoop.fs.LocalFileSystem.class.getName()); | ||
|
|
||
| // look for all properties, prefixed to be picked up | ||
| for (Entry<String, String> prop : System.getenv().entrySet()) { | ||
| if (prop.getKey().startsWith(HOODIE_ENV_PROPS_PREFIX)) { | ||
|
|
@@ -607,8 +604,8 @@ public static HoodieWrapperFileSystem getFs(String path, SerializableConfigurati | |
| * Helper to filter out paths under metadata folder when running fs.globStatus. | ||
| * @param fs File System | ||
| * @param globPath Glob Path | ||
| * @return | ||
| * @throws IOException | ||
| * @return the file status list of globPath exclude the meta folder | ||
| * @throws IOException when having trouble listing the path | ||
| */ | ||
| public static List<FileStatus> getGlobStatusExcludingMetaFolder(FileSystem fs, Path globPath) throws IOException { | ||
| FileStatus[] statuses = fs.globStatus(globPath); | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess this should be ok. @n3nash agree?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think so. We explicitly whitelist in bundles anwyay, so does not matter