Skip to content

Conversation

@jonvex
Copy link
Contributor

@jonvex jonvex commented May 6, 2024

Change Logs

Abstract io reader and writer to de-hadoop

Impact

de-hadooping for trino

Risk level (write none, low medium or high below)

low

Documentation Update

N/A

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@github-actions github-actions bot added the size:XL PR with lines of changes > 1000 label May 6, 2024
@github-actions github-actions bot added size:L PR with lines of changes in (300, 1000] and removed size:XL PR with lines of changes > 1000 labels May 9, 2024
@jonvex jonvex force-pushed the create_hudi_io_factory branch from 1a3d7a8 to 5bf8143 Compare May 9, 2024 20:52
@github-actions github-actions bot added size:XL PR with lines of changes > 1000 and removed size:L PR with lines of changes in (300, 1000] labels May 9, 2024
@github-actions github-actions bot added size:L PR with lines of changes in (300, 1000] and removed size:XL PR with lines of changes > 1000 labels May 9, 2024
baseInstantTime,
new LocalTaskContextSupplier(),
true)) {
HoodieConfig cfg = new HoodieConfig();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this test class be moved to hudi-hadoop-common module? HoodieAvroWriteSupport<IndexedRecord> needs to be tested.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure what you are suggesting here

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Basically, making sure that the bloom filter writing is tested with HoodieAvroWriteSupport.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

image Ran a test with the debugger and this shows that bloom filter is used

Copy link
Contributor

@yihua yihua left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, rename the PR to "Make Hudi reader and writer factory APIs Hadoop-independent"?

@jonvex jonvex changed the title [HUDI-7350] Create hudi io factory [HUDI-7350] Make Hudi reader and writer factory APIs Hadoop-independent May 9, 2024
@jonvex jonvex requested a review from yihua May 9, 2024 22:47
Copy link
Contributor

@yihua yihua left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hudi-bot
Copy link
Collaborator

hudi-bot commented May 9, 2024

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@jonvex
Copy link
Contributor Author

jonvex commented May 10, 2024

passed ci:
image

@jonvex jonvex merged commit e42217d into apache:master May 10, 2024
yihua pushed a commit that referenced this pull request May 15, 2024
…nt (#11163)

Abstract io reader and writer to de-hadoop
---------

Co-authored-by: Jonathan Vexler <=>
yihua pushed a commit that referenced this pull request May 15, 2024
…nt (#11163)

Abstract io reader and writer to de-hadoop
---------

Co-authored-by: Jonathan Vexler <=>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:L PR with lines of changes in (300, 1000]

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants