Skip to content

Conversation

@codope
Copy link
Owner

@codope codope commented Dec 27, 2021

Class view in RFC.

Copy link

@vinothchandar vinothchandar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The fast path and the core logic looks good to me. want to do an interactive session to go over any simplifications, understand some threading model/design better. Great job!

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

revert this and get rid of the whitespace changes?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need hudi-hive-sync? for querying?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this wrapping. bit odd to me. but seems like the convention.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rename hudiBaseFileFormat and lets use this terminology consistently everywhere? including HudiConfig

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Base file format ...

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how much of this class is code reused from HiveSplitBackgroundLoader

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we do the same trimming around hbase/parquet-avro - we did for Presto, here as well?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

whats the life cycle of this object? Once per query?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Modelled after an existing HiveSplit?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you point me to where the file pruning happens i.e filtering out files that don't match the filter ?

@codope codope force-pushed the hudi-plugin-cleanup branch from e86685b to c988546 Compare January 27, 2022 13:27
Rebase and resolve conflicts

Use cached thread pool for split generation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants