-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-9622] Add implementation of MergeHandle backed by the HoodieFileGroupReader #13699
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
nsivabalan
merged 56 commits into
apache:master
from
the-other-tim-brown:cow-merge-handle-to-fgr-3
Aug 15, 2025
Merged
Changes from all commits
Commits
Show all changes
56 commits
Select commit
Hold shift + click to select a range
e1166be
Squash: get baseline testing and cow handle setup
the-other-tim-brown b3ecd1a
fix conflicts
the-other-tim-brown 7ab9e93
allow incoming records loader to specify the record schema which may …
the-other-tim-brown b508a13
move conversion to loader
the-other-tim-brown 3e441d3
fix handling of auto-keygen flow, update test which requires errors
the-other-tim-brown 929aa84
fix schema used in buffered record after projection, remove unnecessa…
the-other-tim-brown e4997d6
update test setup to recreate table if populate meta fields is false
the-other-tim-brown 9c41a8f
pass in the reader context factory so we can use engine specific reco…
the-other-tim-brown f4b86e7
cleanup
the-other-tim-brown 5e957ba
fix expression payload handling (still 2 test failures)
the-other-tim-brown a14f916
add temporary shouldIgnore step
the-other-tim-brown 735eec1
fix sentinel case for index utils
the-other-tim-brown e6fab6e
Add custom merger test
a3edd13
Change custom merger logic to accept lower ordering value records
3400d07
fix test setup
the-other-tim-brown 22dc097
move logic to update processer for skipping in payload case, add comm…
the-other-tim-brown 4da3472
fix update processor check
the-other-tim-brown ded87dc
clean up repeated code
the-other-tim-brown 10259e2
fix delete context in buffer loader to match incoming record schema, …
the-other-tim-brown 70ace82
handle expression payload field rewrite when shouldIgnore is false an…
the-other-tim-brown 5c62e8c
remove changes to compaction flow for index update, ensure consistent…
the-other-tim-brown fb8218d
add support for merger shouldFlush
the-other-tim-brown c182418
fix multi-format writes, fix test serialization issues
the-other-tim-brown 0dbe048
use new static instance in test
the-other-tim-brown c90c00f
fix multi-format on java reader
the-other-tim-brown 404dac8
add concept of used keys to allow duplicate updates
the-other-tim-brown 3fff643
move addKey to common place with null check
the-other-tim-brown 6e74365
fix handle factory expectations to match new defaults, update TestCus…
the-other-tim-brown 8604cbc
address feedback on StreamingFileGroupRecordBufferLoader taking in sc…
the-other-tim-brown 9cf49e9
cleanup FileGroupReaderBasedMergeHandle initialization
the-other-tim-brown 83b63a2
add hasLogFiles method to InputSplit
the-other-tim-brown ba96270
move some utility methods to OrderingValues, fix IOUtils
danny0405 706ba1e
move reader context factory to hoodie table
danny0405 fd30e59
fix close to return existing write status if already closed instead o…
the-other-tim-brown 3adf0b9
fix test setups after changes
the-other-tim-brown f18ff16
limit the reader context factory to just spark COW upsert scenarios
danny0405 51b615d
refresh timeline in test
the-other-tim-brown 2cc89df
handle small file updates on MoR
the-other-tim-brown 8a083cb
remove indentation only changes to files
the-other-tim-brown 3d2b485
update close to return write status if already closed, handle schema …
the-other-tim-brown cea6238
update schema evolution check
the-other-tim-brown 4503d4e
fix cache bug
the-other-tim-brown f6572e3
fix query schema and undo change to InternalSchemaUtils
the-other-tim-brown b150753
fix close sequence on flink if already closed
the-other-tim-brown ff39bef
remvoe dupe key handling
the-other-tim-brown 00569c2
remove shouldFlush handling and test cases
the-other-tim-brown 4f3f3af
use legacy writer class for duplicate data cases
the-other-tim-brown af3a4b6
style
the-other-tim-brown d8c20e5
fix schema used in gcs test
the-other-tim-brown a234757
add requsted comments, add unit tests for update processor, fix NPE w…
the-other-tim-brown 539cf8b
update comment on shouldPreserveRecordMetadata case
the-other-tim-brown 85139a1
mark test as ignored
the-other-tim-brown cc430c9
Adding tests for event time metadata
nsivabalan 04242c5
Address minor comments, fix schema for merger in index utils
the-other-tim-brown 2a9cad3
make operation -> compactionOperation to avoid overlap in naming in F…
the-other-tim-brown 64d3575
use write schema without meta fields for expression payload merge
the-other-tim-brown File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.