-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-3254] Introduce HoodieCatalog to manage tables for Spark Datasource V2 #4611
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
5feee50 to
7df4b2c
Compare
b6b74ca to
addb9f9
Compare
|
@hudi-bot run azure |
...di-spark3/src/main/scala/org/apache/spark/sql/connector/catalog/HoodieIdentifierHelper.scala
Outdated
Show resolved
Hide resolved
...rce/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/analysis/HoodieSpark3Analysis.scala
Show resolved
Hide resolved
| val tableSchema = HoodieSqlCommonUtils.getTableSqlSchema(metaClient) | ||
| if (tableSchema.isDefined && schema.isEmpty) { | ||
| // Fill the schema with the schema from the table | ||
| c.copy(tableSchema = tableSchema.get) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i think we don't need to fill the table schema in here. We can move this process to HoodieCatalog.createHoodieTable.
...rce/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/analysis/HoodieSpark3Analysis.scala
Outdated
Show resolved
Hide resolved
hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/sql/hudi/SparkAdapter.scala
Show resolved
Hide resolved
...source/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/HoodieSqlCommonUtils.scala
Outdated
Show resolved
Hide resolved
hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/TestHoodieSparkSqlWriter.scala
Outdated
Show resolved
Hide resolved
...mon/src/main/scala/org/apache/spark/sql/hudi/command/AlterHoodieTableAddColumnsCommand.scala
Outdated
Show resolved
Hide resolved
...datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/analysis/HoodieAnalysis.scala
Outdated
Show resolved
Hide resolved
...source/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieConfigHelper.scala
Outdated
Show resolved
Hide resolved
...source/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieConfigHelper.scala
Show resolved
Hide resolved
...rce/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieInternalV2Table.scala
Outdated
Show resolved
Hide resolved
...rce/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/HoodieInternalV2Table.scala
Outdated
Show resolved
Hide resolved
...atasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/BaseStagedTable.scala
Outdated
Show resolved
Hide resolved
xushiyan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@leesf I had another pass. I think overall logics make sense. Do you have any test plans for this change? feel free to share any test setup you have. It'll be much preferred to see some tests results to help verify the end-to-end functionalities.
...-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/IncrementalRelation.scala
Outdated
Show resolved
Hide resolved
hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/HoodieSqlUtils.scala
Show resolved
Hide resolved
...datasource/hudi-spark3-common/src/main/scala/org/apache/spark/sql/HoodieSpark3SqlUtils.scala
Outdated
Show resolved
Hide resolved
hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/hudi/Spark3DefaultSource.scala
Outdated
Show resolved
Hide resolved
...atasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/BaseStagedTable.scala
Outdated
Show resolved
Hide resolved
...atasource/hudi-spark3/src/main/scala/org/apache/spark/sql/hudi/catalog/BaseStagedTable.scala
Outdated
Show resolved
Hide resolved
2d6d4b7 to
b67b0c1
Compare
|
verified via spark shell basic read write are working cc @leesf |
Tips
What is the purpose of the pull request
Introduce HoodieCatalog to manage Hudi tables for Spark Datasource V2.
Brief change log
(for example:)
Verify this pull request
(Please pick either of the following options)
This pull request is a trivial rework / code cleanup without any test coverage.
(or)
This pull request is already covered by existing tests, such as (please describe tests).
(or)
This change added tests and can be verified as follows:
(example:)
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.