-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-5936] Fix serialization problem when FileStatus is not serializable #8190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@hudi-bot run azure |
hudi-common/src/main/java/org/apache/hudi/metadata/FileSystemBackedTableMetadata.java
Show resolved
Hide resolved
hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieSerializableFileStatus.java
Outdated
Show resolved
Hide resolved
|
Can you rebase with the latest master and re-trigger the test? |
hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieSerializableFileStatus.java
Outdated
Show resolved
Hide resolved
hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieSerializableFileStatus.java
Outdated
Show resolved
Hide resolved
|
There are test failues, can you squash with the latest master, let's see whether the failures could be fixed. |
hudi-common/src/main/java/org/apache/hudi/common/fs/HoodieSerializableFileStatus.java
Outdated
Show resolved
Hide resolved
...-spark-client/src/test/java/org/apache/hudi/common/fs/TestHoodieFileStatusSerialization.java
Outdated
Show resolved
Hide resolved
|
@CTTY You need to rebase with the latest master code to trigger the Azure CI, there had been some changes on the Azure CI conf files. |
danny0405
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, fine with the change, can you take care of the test failures @CTTY ~
|
@CTTY It looks like Hive query fails in the bundle validation due to: |
Thanks for pointing this out. I guess we still need a new |
# Conflicts: # hudi-common/src/main/java/org/apache/hudi/common/fs/FSUtils.java
|
|
||
| // List all directories in parallel | ||
| engineContext.setJobStatus(this.getClass().getSimpleName(), "Listing all partitions with prefix " + relativePathPrefix); | ||
| List<FileStatus> dirToFileListing = engineContext.flatMap(pathsToList, path -> { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@CTTY looks like in the latest master, we no longer return FileStatus here (the Path instances are used instead). Is this PR still needed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You are right, I don't think this is needed anymore
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Got it. I'll close this PR.
Change Logs
Fixed a potential serialization issue when Hudi is running on FileSystem implementation whose FileStatus is not serializable.
Impact
no impact
Risk level (write none, low medium or high below)
none
Documentation Update
Describe any necessary documentation update if there is any new feature, config, or user-facing change
ticket number here and follow the instruction to make
changes to the website.
Contributor's checklist