-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-3902] Fallback to HadoopFsRelation in cases non-involving Schema Evolution
#5352
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DefaultSource.scala
Show resolved
Hide resolved
nsivabalan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one clarification. looks good to me.
|
@xiarixiaoyao Can you please skim this PR as well |
…nstead pass the config as param (which would be passed along to fork individual Hadoop conf)
de28a2a to
6f2b012
Compare
|
@alexeykudinkin |
|
@xiarixiaoyao it will most likely solve this particular issue, but we'll continue to be exposed to this becoming a problem again until we upstream the real fix and make |
6f9189b to
b975d32
Compare
…ema Evolution (#5352) Co-authored-by: Raymond Xu <[email protected]>
Tips
What is the purpose of the pull request
Spark unfortunately predicates some of its optimization
Rules (and some other handling) on usage ofHadoopFsRelationwhich leads to those optimizations not being applied when we rely on our customRelationimpls.To work this around for the 0.11, we fallback to
HadoopFsRelationin cases when it's feasible to do so.Brief change log
toHadoopFsRelationmethod forBaseFileOnlyRelationHadoopFsRelationfor non-Schema Evolution use-casesVerify this pull request
This pull request is already covered by existing tests, such as (please describe tests).
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.