[MINOR] Fixed hadoop configuration not being applied by FileIndex #8595

Paddy0523 · 2023-04-28T07:45:51Z

Change Logs

Fixed hadoop configuration not being applied by org.apache.hudi.source.FileIndex

Impact

FileIndex uses the DEFAULT HoodieFlinkEngineContext to get the partitionPath without using the information in the configuration.

Since I was connecting to a remote hadoop, I subsequently got the following error due to missing configuration

Risk level (write none, low medium or high below)

none

Documentation Update

none

Contributor's checklist

Read through contributor's guide
Change Logs and Impact were stated clearly
Adequate tests were added if applicable
CI passed

Paddy0523 · 2023-04-28T07:46:49Z

PTAL.@danny0405

danny0405 · 2023-05-01T08:53:34Z

hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/source/FileIndex.java

    }
    String[] partitions = getOrBuildPartitionPaths().stream().map(p -> fullPartitionPath(path, p)).toArray(String[]::new);
-    FileStatus[] allFiles = FSUtils.getFilesInPartitions(HoodieFlinkEngineContext.DEFAULT, metadataConfig, path.toString(), partitions)
+    FileStatus[] allFiles = FSUtils.getFilesInPartitions(hoodieFlinkEngineContext, metadataConfig, path.toString(), partitions)


What kind of hadoop configuration do you wanna to pass around?

Something like that

danny0405 · 2023-05-04T01:53:26Z

hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/source/FileIndex.java


  private FileIndex(Path path, Configuration conf, RowType rowType, DataPruner dataPruner, PartitionPruners.PartitionPruner partitionPruner, int dataBucket) {
+    org.apache.hadoop.conf.Configuration hadoopConf = HadoopConfigurations.getHadoopConf(conf);
+


Can we keep the hadoop conf as member instead?

yeah, I will keep the hadoop conf as member

I got a warning. Can we ignore it? Or do I need to make any other changes

Generating the HoodieFlinkEngineContext on the fly should be fine.

you are right.

hudi-bot · 2023-05-04T04:56:10Z

CI report:

21e3090 UNKNOWN
ae38bcb Azure: SUCCESS

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

…he#8595) Passed the hadoop config options from per-job to the FileIndex correctly.

[MINOR] Fixed hadoop configuration not being applied by FileIndex

38e644e

danny0405 reviewed May 1, 2023

View reviewed changes

danny0405 reviewed May 4, 2023

View reviewed changes

Paddy0523 added 2 commits May 4, 2023 09:59

[MINOR] keep the hadoop conf as member

21e3090

[MINOR]Generating the HoodieFlinkEngineContext on the fly

ae38bcb

danny0405 approved these changes May 4, 2023

View reviewed changes

danny0405 self-assigned this May 4, 2023

danny0405 added engine:flink Flink integration incremental-etl labels May 4, 2023

danny0405 merged commit 053dd4b into apache:master May 4, 2023

yihua pushed a commit to yihua/hudi that referenced this pull request May 15, 2023

[MINOR] Fix hadoop configuration not being applied by FileIndex (apac…

6335db2

…he#8595) Passed the hadoop config options from per-job to the FileIndex correctly.

yihua pushed a commit to yihua/hudi that referenced this pull request May 15, 2023

[MINOR] Fix hadoop configuration not being applied by FileIndex (apac…

60229a6

…he#8595) Passed the hadoop config options from per-job to the FileIndex correctly.

yihua pushed a commit to yihua/hudi that referenced this pull request May 17, 2023

[MINOR] Fix hadoop configuration not being applied by FileIndex (apac…

fc3c136

…he#8595) Passed the hadoop config options from per-job to the FileIndex correctly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MINOR] Fixed hadoop configuration not being applied by FileIndex #8595

[MINOR] Fixed hadoop configuration not being applied by FileIndex #8595

Uh oh!

Paddy0523 commented Apr 28, 2023

Uh oh!

Paddy0523 commented Apr 28, 2023

Uh oh!

danny0405 May 1, 2023

Uh oh!

Paddy0523 May 4, 2023 •

edited

Loading

Uh oh!

danny0405 May 4, 2023

Uh oh!

Paddy0523 May 4, 2023

Uh oh!

Paddy0523 May 4, 2023

Uh oh!

danny0405 May 4, 2023

Uh oh!

Paddy0523 May 4, 2023

Uh oh!

hudi-bot commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		private FileIndex(Path path, Configuration conf, RowType rowType, DataPruner dataPruner, PartitionPruners.PartitionPruner partitionPruner, int dataBucket) {
		org.apache.hadoop.conf.Configuration hadoopConf = HadoopConfigurations.getHadoopConf(conf);

[MINOR] Fixed hadoop configuration not being applied by FileIndex #8595

[MINOR] Fixed hadoop configuration not being applied by FileIndex #8595

Uh oh!

Conversation

Paddy0523 commented Apr 28, 2023

Change Logs

Impact

Risk level (write none, low medium or high below)

Documentation Update

Contributor's checklist

Uh oh!

Paddy0523 commented Apr 28, 2023

Uh oh!

danny0405 May 1, 2023

Choose a reason for hiding this comment

Uh oh!

Paddy0523 May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danny0405 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

Paddy0523 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

Paddy0523 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

danny0405 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

Paddy0523 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

hudi-bot commented May 4, 2023

CI report:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Paddy0523 May 4, 2023 •

edited

Loading